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The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specification 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 

4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


Brief Reports 


author’s lines, must not exceed 85 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style of 
the 1957 revision of the APA Publication 
Manual. Headings, tables, and references are 
avoided or, if essential, must be counted in 
the 85 lines. Each Brief Report must be ac- 
companied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 85-line quota: 


1An extended report of this study may be ob- 
tained without charge from John Doe (giving the 
author’s full name and address) or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$— for microfilm or $— for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (a) The extended report 
should be typed with single spacing for 
economy in duplication. (6) Tables and fig- 
ures should be placed adjacent to the text 
which refers to them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual. 
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DEPENDENCY AND THE REINFORCEMENT VALUE 
OF A VERBAL STIMULUS 


ROBERT B. CAIRNS ann MICHAEL LEWIS 


Indiana University 


Generalized reinforcers in the form of at- 
tention, approval, and affection are believed 
to play a role of nuclear importance in the 
maintenance and control of social behavior 
(Dollard & Miller, 1950; Skinner, 1953). 
Studies of verbal conditioning, in particular, 
have been often cited in support of the no- 
tion that simple expressions of attention-as- 
sent constitute a reinforcing state of affairs. 

In view of the inconclusive results of sev- 
eral verbal conditioning studies (Krasner, 
1958), it seems pertinent to question whether 
the reinforcement stimulus used in these stud- 
ies is as invariably reinforcing as it is some- 
times purported to be. Indeed, by no means 
is it clear, either empirically or theoretically, 
that such experimenter behavior as a headnod 
and verbalization of “mm-hmmm” or “that’s 
a good one,” should be equally reinforcing 


for all persons. On the contrary, as Child 
(1954) has suggested, to the extent that the 
reinforcement value of such events as praise 
and approval is learned, one might expect 
there to be marked differences among persons 
in their evaluation of such social reinforce- 
ment. 


One experimental study is particularly rele- 
vant to this expectation. Mandler and Kaplan 
(1956) found that the reinforcing stimulus 
“mm-hmmm” seemed to have little effect on 
the subjects’ production of plural nouns over 
successive reinforcement periods. subse- 
quent interview revealed that some of the 
subjects felt the reinforcing stimulus had a 
negative meaning: these subjects thought it 
meant they were going too fast, giving the 
wrong kinds of words, or otherwise misinter- 
preting the experimenter’s instructions. When 
the data from the two groups were separately 
analyzed, it was found that the subjects who 
evaluated the reinforcing stimulus as a posi- 
tive event showed some evidence of condi- 
tioning in the initial reinforcement period. 
On the other hand, those subjects who gave 


a negative interpretation to the stimulus 
tended to show a decrease in the number of 
plural nouns they produced during the rein- 
forcement period. 

Theoretical statements by Sears and his col- 
laborators (Sears, Maccoby, & Levin, 1957; 
Sears, Whiting, Nowlis, & Sears, 1953) re- 
garding dependency motivation would appear 
to provide an account for such systematic dif- 
ferences among individuals in their response 
to social reinforcement. According to this 
group, dependency refers to a learned mo- 
tivational system, the behavioral manifesta- 
tions of which are help seeking, attention 
seeking, and approval seeking. Following this 
conceptualization of dependency, it seems rea- 
sonable to expect, with Gewirtz and Baer 
(1958), that approval and affection would 
have enhanced reinforcement value for a per- 
son with strong dependency needs. Thus, if 
a highly dependent individual were placed in 
a situation where approval was contingent on 
his behavior, one would predict that he would 
be more likely than a markedly less depend- 
ent person to produce responses that would 
be approved. 

Recent experimental evidence has provided 
some support for this expected relationship 
between dependent behavior and _ response 
to social reinforcement. Studying the effect 
of mild praise on a child’s performance on 
learning lists of paired-associate items, Cairns 
(1959) found that children who were ob- 
served as being highly dependent in their so- 
cial interactions (ready to seek help) showed 
significantly greater improvement in learning 
under praise than did children who mani- 
fested little dependent behavior. Similar find- 
ings have been reported recently by Ends- 
ley and Hartup (1960). Comparing the per- 
formance of groups of preschool children who 
differed in their tendency to seek praise from 
the teacher, these investigators found that the 
most dependent children persisted longer than 
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did the least dependent group on a simple, 
repetitive motor task where persistence was 
associated with social reinforcement. 

While the foregoing studies provide sup- 
port for the hypothesized relationship be- 
tween dependency and social responsivity, it 
would seem desirable to establish whether an 
individual’s subjective evaluation of the re- 
inforcement stimulus is a function of varying 
levels of dependency motivation. The present 
study was designed to investigate this prob- 
lem, and to explore further the relationship 
between dependency and verbal conditioning. 

On the basis of the above considerations, 
the following hypotheses were developed: 

1. In a verbal conditioning situation, the 
High Dependent subjects would give a higher 
positive evaluation to the reinforcement stimu- 
lus than would the Low Dependent subjects. 

2. The High Dependent subjects would 
condition to a significantly higher level, i.e., 
produce more conditioned responses in the 
final block of reinforced trials, than would 
Low Dependent subjects. 

3. Subjects who judge the reinforcement 
stimulus to be a positive event would condi- 
tion to a higher level than would subjects 
who assign a negative rating to the stimulus. 


METHOD 
Subjects 


A total of 60 male college freshmen were studied. 
From an administration of the Edwards (1953) Per- 
sonal Preference Schedule (PPS) to 625 male fresh- 
men students at the University of Pennsylvania, 60 
men who represented extreme scorers on the defer- 
ence and autonomy scales were selected for final 
study.1 The 30 subjects who scored high on defer- 
ence (> 70th percentile) and low on autonomy 
(< 50th percentile) were classified as High Depend- 
ent. Likewise, those 30 subjects high on autonomy 
(> 75th percentile) and low on deference (< 50th 
percentile) were considered to be Low Dependent. 
A previous validation study by Bernardin and Jessor 
(1957) indicates that this combination of scores on 
the PPS is significantly related to such manifest de- 
pendent behavior as help seeking and quality of 
performance under conditions of verbal punishment. 
Further indirect or construct validation of the scales 
is provided by the differential response of the High 
and Low Dependency subjects to our recruitment 
procedures. Each of the university freshmen who 


1The authors wish to express their appreciation 
to Marvin Hurvich for making these PPS records 
available. 
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met our criteria for selection received a letter, then 
a follow-up postcard requesting their participation 
in the study. Because PPS scores were available on 
all potential subjects, it was possible to compare the 
group who appeared with the group who did not 
respond. Such a comparison reveals that significantly 
more High Dependent subjects came in for an ap- 
pointment (x* = 4.61; p< .05). 


Measures of Dependency 


Two additional dependency measures, the Inter- 
personal Checklist ,(ICL) and Behavioral Depend- 
ency (BD) test were administered to each subject. 
The ICL, devised by Leary and his associates (1957), 
is an inventory consisting of eight clusters of 16 self- 
referent items. For the purposes of this study, only 
one cluster was scored: Octant VI, consisting of ad- 
jectives and descriptive phrases that Leary classifies 
as “Docile-Dependent” items. The ICL dependency 
score was the absolute number of items that the sub- 
jects checked in Octant VI, including such responses 
as “likes to give in,” “dependent,” “very anxious to 
be approved of,” etc. 

Since both the PPS and ICL are self-description 
inventories, it seemed desirable to obtain a measure 
which more directly reflected manifest dependent 
behavior. For this purpose, each subject was pre- 
sented with a nearly unsolvable puzzle task designed 
to assess his readiness to ask for assistance when 
confronted with an extremely difficult problem solv- 
ing situation. The task is described in an earlier pa- 
per (Cairns, 1959). The BD score was a response 
latency measure, ie., the length of time the subject 
worked on the puzzle before making a direct plea 
for help. If the subject did not request help within 
15 minutes, the experimenter volunteered to assist. 
Those subjects refusing help at this point were al- 
lowed to continue an additional 3 minutes before 
the experimenter discontinued testing. 


The Conditioning Task 


A procedure similar to that employed by Sarason 
(1958) was followed in the design of the condition- 
ing task. The stimulus material consisted of 149 
3” X 5” cards on which four words were typed: be- 
low “He” appeared three verbs in the third person 
singular form. Since 60 cards were repeated, the sub- 
jects had a total of 209 trials on the conditioning 
task. In the instructions the subjects were asked to 
construct a simple sentence for each card, using the 
pronoun and one of the three words that were typed 
below. For the experimental subjects, the experi- 
menter initiated the reinforcement conditions on the 
11th occurrence of the response of the class to be 
conditioned, and continued reinforcing every such re- 
sponse that occurred during the presentation of the 
remaining cards. The reinforcing stimulus consisted 
of a glance up at the subject and a murmured 
“mm-hmm.” For the control subjects, the experi- 
menter was silent during the presentation of the 
cards. 
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The verbs appearing on the cards were roughly 
equated in terms of frequency of occurrence, as de- 
termined by the Thorndike-Lorge (1944) frequency 
count. On each card, one of the three words had 
strong aggressive connotations, a second had high 
dependency connotations, and a third was neutral. 
The words were selected for inclusion in either the 
aggressive or dependent class on the basis of ratings 
by 40 judges (30 undergraduate and 10 graduate 
students). Verbs which were rarely rated as being 
either dependent or aggressive constituted the neutral 
category. It was thus possible to construct 149 triads, 
each of which contained an item from the three re- 
sponse classes. 

Twenty subjects, 10 High Dependent and 10 Low 
Dependent, were reinforced for constructing aggres- 
sive sentences; another 20 subjects were reinforced 
for emitting dependent words. Of the remaining 
group, 10 High Dependent and 10 Low Dependent 
individuals served as control subjects. The subjects 
were randomly assigned to the experimental treat- 
ment groups. 


Evaluation of the Verbal Stimulus 


Since every subject was aware that the experi- 
menter said “mm-hmmm” from time to time, it was 
possible to obtain an evaluation of the verbal stimu- 
lus from every experimental subject. Each subject 
was presented with an 11-point scale, ranging from 
—5.0 to +5.0, with instructions to judge the pleas- 
antness of the “mm-hmmm.” The rating appeared to 
be an easy one for most subjects, and was usually 
made with little hesitation. 

A male graduate student in his early twenties ad- 
ministered the verbal conditioning task, which was 
introduced under the guise of a reading test stand- 
ardization. Following the conditioning period, the ex- 
perimenter excused himself with the explanation that 
another experimenter, engaged in a different project, 
would like to see the subject. At that time, the sec- 
ond experimenter administered the BD test and ICL 
according to standardized instructions. Following the 
subject’s completion of the ICL, the first experi- 
menter returned to obtain the subjective evaluation 
of the verbal stimulus. 

Neither the first nor the second experimenter was 
aware of the subject’s dependency classification at 
the time the experiment was conducted. Assignment 
of the subjects to the various experimental condi- 
tions was made by a third person. 


RESULTS 
Agreement between the Dependency Measures 


The results show that the dependency meas- 
ures were moderately related to each other. 
When the two dependency groups, as selected 
by the PPS, were compared on their ICL re- 
sponses, it was found that the High Depend- 
ent subjects checked significantly more Octant 


TABLE 1 
COMPARISON OF HIGH DEPENDENT AND Low DEPEND- 
ENT GROUPS WITH RESPECT TO EVALUATIONS OF 
THE VERBAL STIMULUS 


Evaluation 


Non- 


Group positive Positive Total 


High dependent 8 12 20 
Low dependent 16 4 20 


Total 24 16 40 
Fisher’s exact p = .02 
(two-tailed) 


VI items than did the Low Dependent sub- 
jects (¢ = 3.70; p < .001). The product-mo- 
ment correlation between the PPS deference 
scale and the checklist score is r= .44 (p 
< .005). A comparison of the performance 
of the two groups on the BD measure in- 
dicates that the High Dependent subjects 
tended to show somewhat shorter response 
latencies than did the Low Dependent group 
(¢=1.72; 05<p<.10). It should be 


noted that the High Dependent group tended 


to have more subjects who asked for help be- 
fore the end of the first 15-minute period, 
as well as fewer who refused assistance until 
it was necessary to remove the puzzle from 
them. A similar positive relationship was 
found between the ICL dependency score 
and the BD task, with the group of subjects 
who checked the fewest items in Octant VI 
requiring a significantly longer time before 
asking for help (t = 2.26; p < .05). 


Prediction I: Subjective Evaluation of the 
Verbal Stimulus 


The expectation that the High Dependent 
and Low Dependent subjects would differ 
in their subjective evaluation of the verbal 
stimulus was clearly supported by these data 
The 40 experimental subjects were separated 
into two groups on the basis of their inter- 
pretation of “mm-hmmm”; 16 subjects gave 
a positive evaluation to the stimulus and 24 
subjects considered the stimulus to be either 
negative or neutral. The resulting 2 x 2 table 
is presented in Table 1. The extent to which 
the obtained cell frequencies deviate from the 
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TABLE 2 


MepDIAN NuMBER OF TRIALS FOR HIGH DEPENDENT AND Low DEPENDENT GROUPS 
TO REAcH TEN OccURRENCES OF REINFORCED RESPONSE CLASS 


Prerein- 
forcement 
series 


Group 


Final rein- 
forcement 
series 


Change score 
index of 
conditioning 


High Dependent group 
Aggressive 
responses (V = 10) 
Dependent 
responses (V = 10) 
Total (NV = 20) 
Low Dependent group 
Aggressive 
responses (V = 10) 
Dependent 
responses (NV = 10) 


Total (NV = 20) 


TABLE 3 


RESULTS OF THE COMPARISONS OF THE HIGH DEPEND-— 
ENT AND Low DEPENDENT GROUPS IN RESPONSE 
TO THE VERBAL STIMULUS 


Mann- 
Whitney 


Comparison groups U value 


Prereinforcement series 


HD (dep) vs. LD (dep) 
HD (agg) vs. LD (agg) 
HD (dep) vs. HD (agg) 
LD (dep) vs. LD (agg) 
HD (dep + agg) vs. LD (dep + agg) 


36.0 
49.0 
20.0* 
41.5 
166.5 


Final reinforcement series 


HD (dep) vs. LD (dep) 
HD (agg) vs. LD (agg) 
HD (dep) vs. HD (agg) 
LD (dep) vs. LD (agg) 
HD (dep + agg) vs. LD (dep + agg) 


36.5 
18.0* 
49.5 
36.0 
107.0* 


Change score index 


HD (dep) vs. LD (dep) 
HD (agg) vs. LD (agg) 
HD (dep) vs. HD (agg) 
LD (dep) vs. LD (agg) 
HD (dep + agg) vs. LD (dep + agg) 


47.0 
19.5* 
32.0 
35.5 
140.5* 


Note.—HD High Dependent group. LD Low Dependent 
group. (dep) reinforcement followed dependent word. (agg) 
reinforcement followed aggressive word. 

*p <.05; two-tailed. 


expected values is reflected by the Fisher 
probability estimate based upon the total 
table. It should be noted that the significance 
appears to be due primarily to the tendency 
of the Low Dependent subjects to judge the 
stimulus as being a nonpositive event. This 
trend among the Low Dependent subjects to 
avoid a positive rating, when tested directly 
by the binomial expansion, proved to be sub- 
stantial (p < .05). 

A test of the relationship between the ICL 
measure of dependency and the subjective 
evaluation of the reinforcement stimulus 
yielded similar results. The product-moment 
correlation between the ICL dependency 
score and the subjective ratings was small, 
but significant in the expected direction 
(r = 33; p< .05). The corresponding cor- 
relation between the puzzle scores and the 
evaluations of the verbal stimulus was, how- 
ever, nonsignificant (r = .23; p > .05). 


Prediction II: Dependency and Conditioning 


According to the second prediction, it was 
expected that the High Dependent group 
would show a significantly higher level of 
conditioning than would the Low Dependent 
group. In Table 2 are reported the median 
number of trials required by the various de- 


33.0 26.0 +6.5 7 
23.0 25.0 40.5 7 
25.0 25.0 +4.5 i 
29.5 39.0 —5.5 
27.5 35.5 +0.5 i" 
28.5 38.5 
| 
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pendency groups to attain 10 occurrences of 
the class of responses which were reinforced. 
The number of trials for two series are pre- 


sented: (a) the prereinforcement series, i.e., - 


the first 10 occurrences of the appropriate 
response class; (6) the final reinforcement 
series, i.e., the last 10 occurrences of the 
reinforced response. In addition, the change 
score index of conditioning is reported. 
Derived by subtracting, for each subject, the 
number of trials in the final reinforcement 
series from the number of trials in the pre- 
reinforcement series, the change scores repre- 
sent the most accurate estimate of the effect 
of the reinforcement procedures. 

The results of the comparisons among the 
dependency subgroups are shown in Table 
3. In the first, nonreinforced series, the High 
Dependent subjects produced words con- 
noting dependency more readily than they 
produced aggressive words. The correspond- 
ing difference among the Low Dependent 
subjects did not approach significance. 

In the final reinforcement series, the clear- 
est difference to emerge was that between 
the two groups reinforced for producing 
aggressive sentences. Whereas the High 
Dependent subjects tended to increase their 
rate of reinforced responses, the Low De- 
pendent group showed a significant decrement 
below its performance in the prereinforcement 
series. This negative change of the Low 
Dependent group, as tested by the Wilcoxon 
matched-pairs test, proved to be statistically 
significant (p< .05). There was, however, 
no substantial difference between the change 
scores of the High and Low Dependent sub- 
groups which were reinforced for producing 
words with dependency connotations. Thus, 
when the overall comparison between the 
High Dependent and Low Dependent groups 
is tested, irrespective of class of response 


reinforced, the obtained difference just fails 
to reach the predetermined level of sig- 
nificance. 

Comparable trends were present in the 
relationship between the ICL dependency 
scores and amount of change observed in the 
conditioning situation. The product-moment 
correlation between the change score index 
of conditioning and the ICL dependency 
measure was small, but was in the predicted 
direction (r= .30; p< .05). The change 
score index of conditioning was, however, 
unrelated to performance on the BD task 
(r= .11; p> .10). 


Prediction III: Conditioning as a Function 
of the Evaluation of the Verbal Stimulus 


According to the final prediction, it was 
expected that a positive relationship would 
be found between the subjects’ evaluations 
of the “mm-hmmm” and the level of con- 
ditioning attained. To evaluate this hypothe- 
sis, the dependency measures were disre- 
garded and the positive and negative evalu- 
ators were compared in terms of their verbal 
conditioning performance (see Table 4). 
While medians for the two groups did not 
differ significantly in the prereinforcement 
series, the differences were clearly significant 
in the final block. Similarly, a comparison of 
the change scores indicates that the inter- 
action between the two groups approached 
significance. To explore further the magnitude 
of this relationship, the change scores were 
correlated with the subjective evaluations of 
the stimulus. The obtained product-moment 
correlation indicates that the relationship was 
only a moderate one (r = .35). 


DISCUSSION 


This study represents one phase of a re- 
search program, the general purpose of which 


TABLE 4 


MEDIAN NuMBER OF TRIALS FOR POSITIVE AND NONPOSITIVE EVALUATORS TO 


REACH TEN OCCURRENCES OF REINFORCED RESPONSE CLASS 


Prereinforcement 
Evaluation series 
Positive (N = 16) 
Nonpositive (V = 24) 


24.5 
30.0 


Final reinforcement 
series 


Change score index 
of conditioning 
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TABLE 5 


RESULTS OF THE COMPARISONS OF PosITIVE AND Non- 


POSITIVE EVALUATORS IN RESPONSE TO VERBAL 


Ropert B. CarRNs AND MICHAEL LEWIS 


STIMULUS 
Comparison Mann-Whitney 
series U value p* 
Prereinforcement 146.0 >.10 
Final 90.0 <.01 
Change 129.0 <.10 


* Two-tailed tests. 


is to investigate the motivational factors 
involved in determining responsivity to ex- 
ternal evaluation. The data reported in this 
paper have provided, for the most part, 
preliminary support for the notion that an 
individual’s response to social reinforcement 
is related to aspects of his dependency 
behavior. 

Perhaps the most significant single finding 
of this study was the obverse response of the 
Low Dependent subjects to the reinforce- 
ment procedures. Not only did the Low 
Dependent subjects assign significantly fewer 
positive ratings to the stimulus, but these 
subjects showed a nontrivial decline in the 
production of the class of responses rein- 
forced. Furthermore, this “negative con- 
ditioning” of the Low Dependent subjects 
appeared to account for most of the difference 
between the High and Low Dependent groups, 
i.e., the High Dependent group showed only 
minimal evidence of the expected verbal con- 
ditioning phenomenon. 

It should be noted that, as a group, the 
Low Dependent subjects tended to deny any 
form of dependency behavior, whether on 
the self-report inventories or in the limited 
social setting of the Behavioral Dependency 
test. This failure to behave in a dependent 
fashion, even under conditions where a de- 
pendent response would seem to be the most 
appropriate one, suggests that, for these sub- 
jects, dependency behaviors are strongly in- 
hibited. If this notion is correct, the find- 
ing of a decrement in conditioned responses 
among the Low Dependent subjects becomes 
more understandable. Insofar as experimenter 
attention and approval constitute cues for 
dependency, these events would be aversive 


stimuli for a person who attempts to inhibit 
dependency behavior, and presumably would 
lead him to avoid responses which elicit de- 
pendency cues. Such was the case with the 
Low Dependent subjects, who both evaluated 
the “mm-hmmm” to be a nonpositive stimu- 
lus and tended to avoid responses that would 
lead to a recurrence of that stimulus. Thus, 
rather than consider these subjects as simply 
lacking in dependency instigation, or having 
low dependency needs, it seems that their 
behavior might more appropriately be viewed 
as an active inhibition of dependency 
strivings. 

Regarding the relationship between the 
dependency measures and the subjects’ sub- 
jective evaluations, it might be argued that 
the High Dependent subjects tended to rate 
the “mm-hmmm” as being a positive event 
simply because they wanted to avoid any 
behavior that might be interpreted as criti- 
cism of the experimenter. Accordingly, a 
positive rating might be obtained whether or 
not the High Dependent subjects had sub- 
jectively interpreted the verbal stimulus to 
be rewarding. Although such a'response set 
was not investigated in this study, it seems 
entirely possible that something akin to “yea- 
saying” did play a role in the ratings. However, 
the finding of a significant relationship be- 
tween the ratings and the subjects’ production 
of conditioned responses seems to indicate 
that the ratings reflected, to some extent, the 
subjects’ actual subjective evaluation of the 
verbal stimulus. 

While most of the experimental data pre- 
sented here can be accounted for satisfacto- 
rily in the dependency framework, it should 
be noted that one of the findings of this study 
is more difficult to interpret; namely, the 
failure of the experimental subjects to show 
evidence of strong positive conditioning. Even 
subjects who assigned a positive evaluation 
to the verbal stimulus did not show a sig- 
nificant increment over the prereinforcement 
level. Previous reports (Krasner, 1958) indi- 
cate that such factors as directness of the 
reinforcing stimulus, personal characteristics 
and prestige of the experimenter, and the 
nature of the response reinforced influence 
the absolute level of conditioning which sub- 
jects attain. A review of the present design 
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indicates that each of these factors may have 
been operative in our experimental situation, 
e.g., young experimenter reinforcing emotion- 
ally charged words with minimal approval. In 
particular, the nature of the response cate- 
gory reinforced seemed highly relevant for 
the present study. Those groups reinforced 
for aggressive responses tended to show evi- 
dence of a conditioning phenomena, whereas 
there was slight conditioning among subjects 
in the groups reinforced for producing de- 
pendency words. In this regard, it should be 
noted that the judges who selected the words 
initially tended to show considerably greater 
agreement among themselves in_ selecting 
words for the aggression category than they 
showed in selecting dependency words. 

Finally, a comment regarding the relative 
predictive power of the three types of de- 
pendency measures seems appropriate. On the 
basis of previous results (Cairns, 1959), it was 
expected that the behavioral measure would 
be a relatively effective predictor of behavior 
in the conditioning situation. This expecta- 
tion obviously was not borne out by the 
results. Whereas both self-description meas- 
ures were related to the subjective evaluations 
of the reinforcing stimulus as well as to the 
amount of conditioning, the Behavioral De- 
pendency scores were significantly related to 
neither. A re-evaluation of the puzzle material 
suggests that part of the difficulty stems from 
scoring simply in terms of response latency. 
When the data were analyzed using a refined 
scoring scheme that takes into account both 
latency and the nature of the comments the 
subject makes when he accepts help, some- 
what clearer trends in the predicted direction 
are found. Even here, however, the relation- 
ships between the puzzle scores and the de- 
pendent variables fail to reach an acceptable 
level of significance. 


SUMMARY 


The purpose of this investigation was to 
explore the influence of dependency motiva- 
tion on the effectiveness of social reinforce- 
ment. Sixty freshman males were separated 
into High and Low Dependent groups on the 
basis of their responses to the Edwards Per- 
sonal Preference Schedule. 

To test the general hypothesis that ex- 


REINFORCEMENT 


pressions of assent-approval would have dif- 
ferential reinforcement value for High and 
Low Dependent individuals, 20 subjects from 
each group were placed in a conditioning 
situation where a selected class of responses 
was reinforced. The remaining 20 subjects 
served as a control group. Following the con- 
ditioning procedures, subjective evaluations 
of the reinforcement stimulus were obtained. 
All subjects were administered two additional 
dependency measures, the Interpersonal 
Checklist and a behavioral sample of the 
subjects readiness to behave in a dependent 
manner. These three measures of dependency 
were shown to be moderatly interrelated. 

According to the first prediction, the High 
Dependent subjects were expected to assign a 
higher positive value to the reinforcement 
stimulus than would the Low Dependent 
group. While both inventory measures of 
dependency supported this prediction, the 
behavior sample failed to yield significant 
results. Secondly, it was expected that the 
High Dependent subjects would show a sig- 
nificantly higher level of conditioning than 
would the Low Dependent subjects. This pre- 
diction was only partially supported. The 
differences that were obtained were due pri- 
marily to the sharp decrement in performance 
by the Low Dependent subjects rather than 
to evidence of conditioning in the High De- 
pendent subjects. 

The final prediction stated that the sub- 
jects’ evaluation of the reinforcement stimulus 
would be positively related to the amount of 
conditioning that occurred. This expectation 
was clearly supported by the data with those 
subjects who gave negative evaluations to the 
reinforcement stimulus showing a decrement 
in the production of reinforced responses. 
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COMMENT ON LEVI’S “ORTHOPEDIC DISABILITY AS 
A FACTOR IN HUMAN-FIGURE PERCEPTION” 


A. B. SILVERSTEIN 
Pacific State Hospital 


On the basis of findings recently reported 
in this journal, Levi (1961) claimed support 
for a hypothesized one-to-one relationship be- 
tween the physical body and the body image. 
It is the present writer’s contention that Levi 
did not present sufficient data to justify her 
claim. The specific hypotheses of Levi’s study 
were that “subjects with disability of the legs 
will be unusually sensitive to the legs in draw- 
ings of the human figure and that subjects 
wth disability of the arms will be unusually 
sensitive to the arms in drawings”; also, that 
a group of back disabilities “will show a 
greater resemblance to the characteristics of 
a control group of nondisabled” (p. 253). 

Using an ingenious methodology, Levi 
found that subjects with disability of the legs 
were indeed significantly more sensitive than 
control subjects to the legs of stick figures, 
and that subjects with disability of the arms 
were significantly more sensitive than con- 
trols to the arms of the figures. Moreover, 
the group of back disabilities did not differ 
significantly from the controls in sensitivity 
to the bodies of the figures. These results are 
entirely consistent with Levi’s specific hy- 
potheses, but in themselves they do not con- 
stitute a decisive test of her general hy- 
pothesis. 


Specifically, Levi failed to provide answers 
to the following critical questions: Were sub- 
jects with disability of one part of the body 
significantly more sensitive to that part of the 
body than to other parts? Were they sig- 
nificantly more sensitive than controls to that 
part of the body only? It is at least theoret- 
ically possible that, for example, subjects with 
a disability of the legs were no more sensitive 
to the legs than to the arms, or that they were 
significantly more sensitive than controls to 
the arms, as well as to the legs. If so, the 
results would not support the general hy- 
pothesis. 

The data which Levi did present, coupled 
with considerations of her method, make such 
a possibility appear rather unlikely, but they 
do not completely rule it out. In claiming that 
her findings support the hypothesis of a one- 
to-one relationship between the physical body 
and the body image, Levi has a responsibility 
to take into account ail of her data. 
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The distinction that Silverstein makes be- 
tween my general hypothesis and my specific 
hypotheses is more apparent than real. If 
arm-disabled subjects prove to be “unusually 
sensitive to the arms in drawings” (Levi, 
1961, p. 253), it means that they are more 
sensitive to arms than to other parts. 

In the event that the pattern of body-part 
choices is of interest to readers, a complete 
list of mean choices for all three experimental 
groups is offered. 

Silverstein’s second question is not really 
different from his first. They both ask whether 
the whole level of perceptual sensitivity to 
all parts of the body is higher in disabled 
than in normal persons. This problem is ir- 
relevant to my study. The forced-choice aspect 
of my method sacrifices information about 
overall level of sensitivity for the sake of 
clarifying differences among kinds of sensi- 
tivity. 


REPLY TO SILVERSTEIN’S “COMMENT” 


AURELIA LEVI 
Albert Einstein College of Medicine, Yeshiva University 


TABLE 1 


MEANS OF THREE DisApiLity Groups 


Group Mean 


Group A choices: 


Arms 14.17 

Legs 5.0 

Back 7.92 
Group L choices: 

Arms 7.61 

Legs 11.23 

Back 8.15 
Group B choices: 

Arms 9.15 

Legs 4.92 

Back 12.92 
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REINFORCEMENT IN A THERAPY-LIKE SITUATION 
THROUGH SELECTIVE RESPONDING TO FEELINGS 
OR CONTENT* 


IRENE E. WASKOW 2 


University of Maryland School of Medicine 


This study is concerned with a simple type 
of learning that may occur early in psycho- 
therapy. The general hypothesis is that a 
therapist may, by selective responding, rein- 
force those verbalizations that he considers 
important. The specific hypothesis being 
tested is that selective therapist responding 
will function as a reinforcer of the feeling or 
content aspects of a patient’s communications. 

Although there are many references in the 
theoretical literature to learning that occurs 
during psychotherapy (see Shoben, 1948, 
1953 for review), only two researchers have 
rigorously studied events in therapy within a 
learning framework. Murray (1956) demon- 
strated increases or decreases in content areas 
which were encouraged or discouraged by a 
therapist and Dittes (1957) interpreted his 
results as extinction of fear connected with 
making embarrassing sex statements in 
therapy. Both of these researchers obtained 
their data from recordings of normally con- 
ducted psychotherapy sessions. In order to 
relate clearly the behavior of the therapist to 
the learning of the patient, in the present 
study the therapist’s behavior was manipu- 
lated and its effects on patient behavior in- 
vestigated. 

Manipulation of the experimenter’s be- 
havior and the study of its effects on the 
subject’s learning have been commonly em- 
ployed by experimental psychologists. Of 
special relevance here are studies of verbal 
conditioning, in which the experimenter at- 


1 This paper is based on a PhD dissertation at the 
University of Wisconsin. The author is indebted to 
Barclay Martin under whose supervision the re- 
search was conducted, to David Grant and Carl 
Rogers who served on the research committee, and 
to Vilma Ginzberg and Tom Tomlinson who served 
as raters. 

2 Formerly a Postdoctoral Research Fellow at the 
National Institute of Mental Health, Bethesda, 
Maryland. 


tempts, through selective responding, to re- 
inforce a particular class of verbal responses. 
Most of these studies (see Krasner, 1958, for 
review) have demonstrated an increase in the 
frequency of the reinforced response class. 
Many writers have compared the procedures 
in these studies to the events that take place 
in therapy, and Krasner (1955) has specifi- 
ally emphasized the value of using the verbal 
conditioning paradigm to investigate vari- 
ables in psychotherapy. The inference here 
is that the therapist may selectively respond 
to particular patient verbalizations. The ex- 
perimental conditions in most studies of ver- 
bal conditioning differ radically, however, 
from those which obtain in psychotherapy. 
For example, many of these studies have 
utilized only simple responses of “umhmm” 
or “good” as reinforcing stimuli; the ver- 
balizations reinforced have frequently been 
grammatical categories such as plural nouns 
or the choice of “I” or “we” from among six 
pronouns in beginning a sentence; and the 
experiments have generally been conducted in 
an experimental laboratory situation. 

A few studies have approached the therapy 
situation more closely, by using more complex 
categories of reinforcing behavior, e.g., agree- 
ment with or paraphrasing of a response 
(Verplanck, 1955), or by reinforcing response 
classes more relevant to therapy, e.g., hostile 
responses (Binder, McConnell, & Sjoholm, 
1957; Buss & Durkee, 1958), family as op- 
posed to nonfamily memories (Quay, 1959), 
and self-references (Adams & Hoffman, 1960; 
Rogers, 1960). Several experiments reported 
since this study was undertaken have specific- 
ally dealt with the reinforcement of “feeling” 
responses. Levine (1958) attempted to rein- 
force expressions of feeling in fairly free in- 
terview situations, and Salzinger and Pisoni 
(1958, 1960) conditioned affect responses in 
both normals and schizophrenics in a situation 
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closely resembling a traditional diagnostic 
interview. 

In the present study, the verbal condition- 
ing paradigm was used in a laboratory situa- 
tion as closely as possible resembling psycho- 
therapy. Accordingly, the experimenter in- 
troduced the study to student volunteers as 
a therapy-like experience. The class of rein- 
forcing stimuli was reflection of certain as- 
pects of the subjects’ communications, a tech- 
nique utilized at times by many therapists. 
It was hypothesized that such reflection would 
fulfill Skinner’s (1953) criteria for a rein- 
forcing stimulus. The response categories re- 
inforced, the feeling as opposed to the content 
aspects of subjects’ communications, have 
been emphasized in both theoretical and re- 
search writings of client centered orientation 
(Rogers, 1942; Seeman, 1954), and are of 
relevance to the widely held view that ex- 
pression of previously punished feelings in 
the presence of a nonpunishing therapist is 
essential for new learning to occur in therapy 
(see, e.g., Dollard & Miller, 1950). 


METHOD 


The therapist selectively responded to different 
types of material in each of three experimental 
groups. In one group (Group F), she responded to 
references to the subjects’ feelings and attitudes, in 
another (Group C), to the descriptive and intellectual 
content of the communication, and in a third group 
(Group FC), to a combination of feelings and 
content. 


Subjects 


Thirty-six subjects were used, 12 in each of the 
experimental conditions. The experimenter asked 
students in discussion sections of an introductory 
psychology course to serve in the experiment for 
points which might improve their grades. They were 
told that the experiment involved research in psy- 
chological interviewing and psychotherapy and they 
were asked to volunteer only if they were interested 
in and felt that they could profit from a therapy- 
like experience. This procedure was used in order 
to obtain subjects at least minimally motivated for 
psychotherapy. Of the volunteers only those who 
had scores in the upper two-thirds of a scale of 
manifest anxiety (Heineman, 1953) were asked to 
come for preliminary interviews, since topics they 
would deal with might more closely resemble therapy 
material. A volunteer who was currently in psy- 
chotherapy was dropped. 
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Preliminary Interview * 


In the preliminary interview, lasting 15 minutes, 
the experimenter first assured subjects of the con- 
fidentiality of the tape recordings and then in- 
structed them to: 


... pretend that you are having your first visit 
at a counseling center and .. . discuss things that 
you think you might talk about if you had gone 
to such a center for help, telling something about 
yourself and about matters that you have been 
thinking about or that have been troubling you, 
and so on. 


Subjects were asked to talk into the microphone, 
and the experimenter then left the room, so that 
there could be no accidental differential reinforce- 
ment during this preliminary interview. 

The experimenter later listened to the taped inter- 
views and assigned each a global rating on a seven- 
point scale, in which a rating of 1 was given to in- 
terviews with major emphasis on the feelings of the 
subject, 4 to interviews with equal emphasis on 
feelings and content, and 7 to interviews with 
major emphasis on content. The position of the 
subjects was largely toward the content end of the 
scale. The 36 subjects were ranked according to 
these ratings and then assigned to twelve triplets, 
ranks 1-3 in one triplet, ranks 4-6 in the next, etc. 
Subjects within each triplet were randomly assigned 
to the three conditions, with the restriction that 
each group contain 10 of the 30 female subjects and 
two of the six males. With subjects in the three 
groups matched for initial level of expressing feeling 
and content, any differences among the groups due 
to differential reinforcement might be expected to 
emerge clearly. 


Experimental Sessions 


Each subject was seen for four experimental in- 
terviews, two a week for 2 weeks. These sessions 
extended over a 6-week period, with four matched 
triplets seen during each consecutive 2 weeks. The 
experimenter functioned as therapist and conducted 
all of the 30-minute interviews. All interviews were 
tape recorded. At the beginning of the first inter- 
view, subjects were told to consider this their first 
session and to talk about the same types of things 
they were asked to discuss during the preliminary 
interview, the only difference being the presence of 
the therapist. If possible, no further instructions 
were given throughout the interviews. At the begin- 
ning of each, the therapist said she would like to 
continue as before. Occasionally, during an inter- 
view, a subject could not proceed. Only general 
suggestions were made at such times, such as “Any 

3GSRs were recorded for all interviews, but will 
not be discussed in this article. Although the thera- 
pist’s impression was that subjects were generally 
not very concerned about the GSR recordings, there 
is no data to support this impression and it is pos- 
sible that they may have influenced the results. 
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other things you’ve been thinking about?” and 
“Maybe something that’s troubled you in the past?” 
Such suggestions occurred only once or twice for 
most subjects. 

The therapist attempted to respond whenever a 
subject paused at the end of a sentence or group of 
sentences concerning one topic. The chief technique 
used was a reflection of the subject’s communication: 
the therapist attempted to mirror, in her own words, 
some aspect of what the subject had said. This type 
of responding, in both F and FC groups, is similar 
to what has been called “reflection of feeling” in 
the client centered literature (Bergman, 1951). In 
the FC group, the therapist’s response included 
content as well as feelings, while in the F group 
only feelings were reflected. Although the thera- 
pist’s reflections were largely based on the subjects’ 
explicit verbal statements, she also responded to 
feelings indirectly expressed through voice tone and 
through facial and bodily movements. The therapist’s 
responses to subjects in the C condition were similar 
to what has been called “restatement of content” 
(Snyder, 1945). 

Following is an illustration of the way in which 
the therapist might have responded to the same 
subject communication in each of the three con- 
ditions. 


Subject: Sometimes my father yells at my 
mother—over just about nothing. But, boy, can 
he get mad. And I don’t like it one bit when he 
does that. 

Therapist: 

F: You really get upset about that. 

FC: You feel real upset when your father gets 
mad at your mother about things that don’t matter. 

C: Sometimes your father gets real mad and 
yells at your mother about little things. 


Aside from the manipulation of the variable being 
studied, the therapist attempted to keep conditions 
as similar as possible for all three experimental 
groups. Since examination of early pilot interviews 
had suggested that the therapist sounded less in- 
terested and less warm in the C condition than in 
the others, she continued pilot work until it ap- 
peared that this variable was held constant. The 
therapist also attempted to maintain a constant 
frequency of responding in the three conditions, and 
an analysis of variance of the number of responses 
in each interview revealed no significant differences 
among the three groups. According to the therapist’s 
retrospective account, the FC condition seemed most 
similar to psychotherapy and was quite easily im- 
plemented. Responses in the C condition often 
seemed superficial and those in the F condition 
sometimes seemed to distort the subject’s meaning 
due to the separation of feelings from the context 
in which they appeared. 

At the end of the fourth interview, subjects were 
given a questionnaire, which they were asked to 
return to the mailbox of “another psychologist in- 
terested in subjects’ reactions to the experiment.” 
The major purpose of the questionnaire was to 


assess, in various ways, the subjects’ awareness of the 
selective responding by the therapist. Subjects were 
asked, eg., which of several areas they thought 
should be stressed in therapy, what areas had 
actually been stressed in their sessions, etc. 

Finally, the experimenter held a 5-minute interview 
with each subject, in which the subject discussed his 
hypotheses about the purpose of the experiment. 


Rating of Interview Material 


Ratings of the subjects’ responses were made 
directly from the tape recordings. A subject re- 
sponse was defined as any communication which 
was surrounded by two therapist responses. Brief 
interruptions during a therapist’s response were not 
rated. A list of rules for classifying responses, which 
had been developed in pilot work, was used for 
guiding decisions in difficult instances. Each response 
was classified as predominantly F, FC, or C or, 
when necessary, one half-credit was given to F and 
FC or to C and FC. These ratings sometimes de- 
pended on more than the simple presence or absence 
of words denoting feeling, since tone of voice was 
also taken into consideration. No inferences were 
made, however, about underlying emotional states, 
and interpretation of results must be limited to the 
conditioning of explicit expressions of feeling. 

The therapist rated all interviews, listening to an 
entire tape and classifying each response as it was 
completed. Parts of an interview were replayed only 
if they could not be heard clearly or if listening was 
interrupted. A second rater, an advanced graduate 
student in clinical psychology, rated 24 of the inter- 
views for an estimate of interrater reliability. The 
24 interviews were randomly selected except for the 
restrictions that they include two each of the first, 
second, third, and fourth interviews in each of the 
three conditions and that there be not more than 
one interview per subject. As a check on the objec- 
tivity of the ratings, the second rater heard only the 
subjects’ statements and was not aware of the ex- 
perimental condition in which a subject was placed. 
The interrater reliability of the F-FC-C profiles for 
the 24 interviews, as estimated by Mosier’s formula 
(Guilford, 1954), was .81. 

For testing the consistency of the therapist’s re- 
sponding in the three conditions, 36 10-minute seg- 
ments were extracted from the interviews. These in 
cluded one segment of an interview with each of the 
36 subjects, and each condition was represented by 
a segment from the first, second, and third 10-minute 
periods of each of the four interviews. The thera- 
pist’s responses were categorized by a graduate stu- 
dent, who had previously practiced making such 
categorizations with the therapist, as focusing pre- 
dominantly on feeling (F), content (C), or feeling 
combined with content (FC), and the percentages 
of F, FC, and C responses were computed for each 
segment. For the entire sample of 12 segments in the 
F condition, the therapist made 77% F, FC, 
and 1% C responses. In the FC condition, she made 
84% FC, 14% F, and 2% C responses, and in the C 
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TABLE 1 


ANALYSIS OF PERCENTAGE OF RESPONSES IN F, FC, AND C CATEGORIES 
For F, FC, anp C Groups OVER INTERVIEWS 


Between means 
Categories 
Groups X categories 

Between individual means 

Between linear trends 

Categories 

Groups X categories 


Between individual linear trends 
Between quadratic trends 
Categories 
Groups X categories 
Between individual quadratic trends 
Between cubic trends 
Categories 
Groups X categories 
Between individual cubic trends 


Total 


222.52° 
11.95* 


2 0285 1.86 

4 .0296 1.93 
66 .0153 

2 0313 3.19 

4 .0077 — 
66 .0098 

2 .0023 -- 

4 .0222 2.06 
64> .0108 


286° 


* No differences possible among treatment groups, since F, FC, and C scores must sum to 100% in all interviews for all groups. 
b Degrees of freedom reduced by two, because missing values estimated for one interview (Cochran & Cox, 1950). 
© Requirement that three scores sum to 100% results in loss of one degree of freedom for each interview, thus total df reduced 
by one-third. 
* Significant at .01 level. 


condition, 88% C and 12% FC responses. In 34 of 
the 36 samples, the therapist responded most fre- 
quently with the response called for by the experi- 
mental condition. In one FC sample, she responded 
predominantly with F responses, and in one F sam- 
ple, with FC responses. 


RESULTS 


In order to test for changes in the F-FC- 
C profiles of subjects over interviews, the 
analysis of variance applied was a trend test 
(Grant, 1956) analyzed so as to reveal dif- 
ferences in profiles (Block, Levine, & Mc- 
Nemar, 1951). Since the data departed to 
some degree from the distribution character- 
istics of normality and homogeneity of vari- 
ance, a probability level of .01 was stipu- 
lated for accepting a result as significant. The 
results of the analysis are presented in Table 1 
and the mean percentages for the three groups 
over interviews are plotted in Figure 1. 

The significant interaction between cate- 
gories and groups tends to support the major 
hypothesis: the subjects in the different ex- 
perimental conditions did produce different 


patterns of responses in the F, FC, and C 
categories. Three Duncan range tests (Dun- 
can, 1955) were used to compare the means 
for the groups in the three categories. Al- 
though the groups did not differ significantly 
in the percentage of F responses they made, 
Group C made a significantly higher percent- 
age of C responses and a significantly lower 
percentage of FC responses than did either 
of the other groups. There were no signifi- 
cant differences between groups FC and F 
and no significant trends over interviews. 
Thus, although Group C had a greater rela- 
tive amount of C throughout the experiment, 
there were no consistent changes in the per- 
centages in different categories over the four 
interviews. The only other significant finding 
was the higher overall percentage of FC re- 
sponses. Even in Group C, FC responses con- 
tinued throughout the interviews to consti- 
tute more than 50% of the total responses. 
Since differences in the percentage of C 
responses appeared as early as the first in- 
terview, an analysis was performed to see 
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whether an increase in C responses in the C 
group could be found within the first inter- 
view. The interview was divided into sixths 
according to the number of responses made 
and a trend analysis identical to the one cited 
above was performed. The results of this 
analysis are presented in Table 2 and the 
mean percentages of F, FC, and C responses 
for sixths of the first interview are plotted in 
Figure 2. Within the first interview, signifi- 
cant results were again found for the differ- 
ence between categories and for the category- 
group interaction. In addition, the compo- 
nent of the difference between quadratic 
trends which may be attributed to the group- 
category interaction was found to be signifi- 
cant. Thus, although the C responses made 
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Fic. 1. Mean percentages of responses in cate- 
gories F, FC, and C for three groups over inter- 
views. 
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Fic. 2. Mean percentages of responses in categories 
F, FC, and C for the three groups during the first 
interview. 


by the C group did not increase in a linear 
fashion, they did show an increase during the 
first part of the first interview, with a re- 
ciprocal decrease in FC responses, and it is 
the initial increase which seems responsible 
for the higher level of C responses found in 
the first interview for the C group. Examina- 
tion of the graphs plotting sixths of the other 
interviews suggests that the percentage of C 
responses for the three groups remains fairly 
stable within each of the interviews following 
the first one. 

There appeared to be no general awareness 
of the material being reinforced. Subjects in 
the three groups did not differ significantly 
(Friedman’s test in Siegel, 1956), in the ex- 
tent to which they believed different types of 
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TABLE 2 


ANALYSIS OF PERCENTAGE OF RESPONSES IN F, FC, AND C CATEGORIES FOR 


F, FC, anp C Groups OvER SIxXTHS OF THE First INTERVIEW 


Source of variation® 


df MS 


Between means 
Categories 
Groups X categories 


Between individual means 


Between linear trends 
Categories 
Groups X categories 


Between individual linear trends 


Between quadratic trends 
Categories 
Groups X categories 


Between individual quadratic trends 
Between cubic trends 

Categories 

Groups X categories 
Between individual cubic trends 
Residual 


Total 


16.3892 


0231 
0223 


0445 


.0377 
.0259 


0355 

.0262 
1 


432 


*No differences possible among treatment groups, since F, FC, and C scores must sum to 100% in all interviews for all groups. 
> Requirement that three scores sum to 100% results in loss of one degree of freedom for each interview, thus total df reduced 


by one-third. 
* Significant at .01 level. 


material had been stressed in the interviews. 
Also, no subject was able, when questioned at 
the conclusion of the experiment and after ad- 
ministration of the questionnaire, to state cor- 
rectly the purpose of the experiment or even 
to approach an adequate explanation. 


DIscussIoNn 


The major hypothesis of this study was 
partially supported. Selective responding to 
the content aspects of subjects’ communica- 
tions did result in a higher percentage of con- 
tent responses. An increase in C responses for 
the C group took place within the first third 
of the first interview and the higher level was 
maintained throughout the rest of the inter- 
views. Thus, reflection of content did func- 
tion as a positive reinforcer. This finding 
demonstrates the occurrence of verbal condi- 
tioning in a situation closely resembling psy- 
chotherapy, in that the experiment was de- 
scribed to subjects as a therapy-like experi- 
ence and the response class reinforced and the 


class of reinforcing stimuli were relevant to 
psychotherapy. 

The successful reinforcement of content re- 
sponses within the first interview leads to an 
interesting speculation: may such reinforce- 
ment early in therapy by some information- 
oriented therapists cause difficulties in their 
later attempts to focus on patients’ feelings? 
To help answer this question, it would be 
necessary to obtain extinction as well as 
learning data, preferably comparing extinc- 
tion of C responses through nonreinforce- 
ment and through reinforcement of F or FC 
responses. 

Although the C group showed successful 
conditioning, selective responding to feelings 
(F group) and to a combination of feelings 
and content (FC group) did not function as 
reinforcers. For the FC group, examination 
of the data of the second and third inter- 
views suggests that possibly a slower and less 
striking conditioning occurred than for the C 
group, an effect not large enough to reach 
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statistical significance. Since all groups had 
an initially high level of FC responses (be- 
tween 60 and 70%), there may not have been 
much room for a significant increase in these 
responses to occur. Although the high level 
of FC responses throughout may seem to in- 
dicate willingness of subjects to deal with 
emotional material, the FC responses made 
were largely concerned with feelings about 
relatively conflict-free, conversational subject 
matter. The FC responses so clearly domi- 
nated the interviews that, even in the suc- 
cessfully conditioned C group, the increased 
percentage of C responses did not exceed the 
percentage of FC responses. 

The F responses, in contrast to the FC re- 
sponses, frequently dealt with personal and 
troubling feelings. Although the percentage of 
F responses was relatively low at the begin- 
ning of the experiment, thus allowing consid- 
erable room for increase, selective responding 
to feelings did not result in the predicted in- 
crease in these responses. Even those subjects 
with a relatively high initial level of expres- 
sion of feelings were not successfully condi- 
tioned. In the only other study in which re- 
flections of feeling were used in an attempt 
to reinforce expressions of feeling (Levine, 
1958), there was a similar lack of reinforc- 
ing effects. Since Levine also failed to condi- 
tion expressions of feeling with the use of 
an “umhmm” reinforcer, one might suspect 
a difficulty inherent in the class of responses 
being reinforced. Salzinger and Pisoni (1958, 
1960), however, present contradictory evi- 
dence: they have successfully reinforced state- 
ments of affect by the use of such reinforcing 
stimuli as “umhmm,” “yeah,” and “I see.” 
Differences between the Salzinger and Pisoni 
studies and the Levine “umhmm” condition 
will not be discussed here, since we are pri- 
marily concerned with the disparity between 
our results and those of Salzinger and Pisoni. 
There are several ways in which the present 
experiment differs from those of Salzinger 
and Pisoni. The responses reinforced in the 
Salzinger and Pisoni studies were defined as 
all statements beginning with “I” and “we” 
followed by an expression of affect. Thus, 
since the class of responses reinforced as well 
as the class of reinforcing stimuli was less 
complex and more clearly defined than that 


in the present study, the contingency between 
the reinforcing stimulus and the reinforced 
responses may have appeared more clearly 
and may, therefore, have been more effective. 
The response-reinforcement contingency in 
the present setting was, however, sufficiently 
clear for the reinforcement of C responses to 
occur.* 

A more important difference between this 
study and the Salzinger and Pisoni experi- 
ments may lie in the nature of the reinforc- 
ing stimulus. The utterance of “umhmm” or 
“yeah” may have simple positive connota- 
tions for a subject and may thus result in an 
automatic reinforcement of the preceding sub- 
ject verbalizations. The reflection of a feel- 
ing, however, may cause a subject to look at 
his feeling and to realize that another person 
is also aware of it. Within the context of the 
present experimental situation, this procedure 
may have been somewhat unpleasant and 
anxiety arousing, especially since the student 
volunteers seemed to prefer chatting to deal- 
ing with troubling emotions. Thus, aversive 
qualities of the reflection of feeling may have 
interfered with its proposed function as a 
positive reinforcer. 

Under different conditions, reflection of 
feelings might function as a reinforcer of ex- 
pressions of feelings. Recent studies (Mandler 
& Kaplan, 1956; Sapolsky, 1960; Sarason, 
1958) have demonstrated the effects on ver- 
bal conditioning of such variables as subject 
personality, characteristics of the subject-ex- 
perimenter interaction, and subjects’ inter- 
pretation of the reinforcing stimulus. Ma- 
nipulation of such variables within the con- 
text of the present study might lead to 
results different from those obtained. For ex- 
ample, individuals more similar to actual psy- 
chotherapy patients may have higher levels of 
expressing troubling feelings and more mo- 
tivation to deal with them; they might, there- 
fore, interpret the reflection of feelings as a 
positive stimulus. Also, as is probably the case 
in therapy, the therapist may first have to 

*It is possible that the F condition differed from 
both the C condition and from Salzinger and Pisoni’s 
procedure in having a longer latency of the thera- 
pist’s responses. Although the therapist’s impression 
is that the latencies were fairly constant in the 


different conditions, the recordings are no longer 
available for study of this variable. 
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establish himself as a generally positive figure 
before he can successfully reinforce expres- 
sions of feeling. It is interesting to note here 
that Salzinger and Pisoni did have a 10- 
minute period of introductory questions in 
which the therapist reinforced everything the 
patient said. 

In the present setting, selective responding 
to feelings did not function as a positive re- 
inforcer. Thus, conditioning of subjects’ ver- 
bal behavior in a therapy-like setting was not 
as easily achieved as one might predict on 
the basis of the verbal conditioning litera- 
ture. In order to justify generalizations to 
psychotherapy from the learning literature, 
the complex variables which may influence 
learning in the therapy situation must be 
recognized and studied. 

The finding that subjects were not aware 
of the selective responding to certain aspects 
of their communications is in agreement with 
the frequent reports of lack of awareness in 
the verbal conditioning literature (see Kras- 
ner, 1958). The subjects in the present study 
seemed, simply to accept the therapist’s con- 
duct as prescribed by a particular type of 
psychotherapy and were chiefly concerned 
with what they could find to talk about. 


SUMMARY 


This study tested the hypothesis that se- 
lective responding by a therapist would func- 
tion as a reinforcer of content or feeling as- 
pects of a subject’s communications. Three 
experimental groups were utilized, one in 
which the therapist reflected feeling aspects 
of the subjects’ communications (Group F), 
one in which she reflected a combination of 
feelings and specific content (Group FC), 
and a third in which she reflected content 
only (Group C). 

The 36 student subjects had volunteered to 
participate in a therapy-like experiment. Trip- 
lets of subjects were matched for level of ex- 
pressing feeling and content in a preliminary 
interview and were then randomly assigned 
to the three groups. Each subject was seen 
for four 30-minute experimental interviews 
spaced over a period of 2 weeks. An analysis 
of the therapist’s responses indicated that the 
experimental conditions in the three groups 
had been established reliably. 


. WAskow 


The subjects’ responses were categorized as 
consisting predominantly of feeling elements 
(F), content (C), or a combination (FC). 
An analysis of the ratings revealed a signifi- 
cant difference in the F-FC-C profiles for the 
three groups, due to a higher percentage of 
C responses, and a correspondingly lower per- 
centage of FC responses, for the C group. 
No significant trends were found over inter- 
views. In a separate analysis of the first in- 
terview, a significant difference was found in 
quadratic trends over sixths of the interview 
for the group-category interaction. An initial 
increase in the C responses for the C group 
appeared responsible for the overall differ- 
ences found. Thus the C group was success- 
fully reinforced and the major effects of the 
reinforcement could be seen within the first 
third of the first interview. No significant dif- 
ferences were found between the F and FC 
groups and they showed no clear effects of 
reinforcement. 


The questionnaire results and discussion 
with the subjects following the experiment 
suggested a lack of awareness by the subjects 
of the selective responding by the therapist. 

The results for the content group consti- 


tute an extension of the verbal conditioning 
literature by demonstrating successful condi- 
tioning in a situation closely resembling psy- 
chotherapy. The lack of reinforcement in the 
F group suggests the presence in this setting 
of aversive properties in the reflection of feel- 
ings. The complex variables of the therapy 
situation must be considered and approached 
experimentally before generalizations are 
made from the verbal conditioning literature 
to psychotherapy. 
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FACTOR ANALYSIS OF RORSCHACH SCORING 
CATEGORIES FOR A POPULATION 
OF NORMAL SUBJECTS 


ROBERT H. GEERTSMA 


Neuropsychiatric Institute, University of California, Los Angeles 


The use of a normal rather than a patient 
population of subjects for factor analytic 
studies of the Rorschach offers the oppor- 
tunity to check hypotheses about dimensions 
of the test without complicating the analysis 
by the possible effects of emotional disturb- 
ance. Despite this significant advantage, only 
two factor analytic studies to date have been 
based on normal populations. Their results 
and conclusions, although not entirely con- 
sistent, are of considerable interest. Witten- 
born (1950) studied the Rorschach responses 
of 92 Yale undergraduates. Using as variables 
20 response categories representing the main 
scoring categories of the Klopfer system, he 
extracted four centroid factors from his cor- 
relation matrix. After rotating these factors 
to an orthogonal simple structure solution he 
concluded: 


1. The factorial composition of M differs 
from that of CF and C. The factorial com- 
position of CF and C is similar. 

2. The factorial composition of FC is dif- 
ferent from that of CF and C. It is more simi- 
lar to M than to CF and C. 

3. There is a factor loading M (and vari- 
ous detail categories) which is different from 
factors loading W, CF, C, texture, and dif- 
fusion. 

4. The various scoring categories correlate 
with R to produce a general productivity fac- 
tor; this factor accounts for differences in the 
common factor variance among the scoring 
categories. 

5. The various subclasses of scoring cate- 
gories (e.g., FC, CF, and C) within the 
broader classes (e.g., color, texture, shading) 
have dissimilar factorial compositions which 
could not be predicted from current inter- 
pretative principles. 


In a recent study, Wishner (1959) pre- 
sented a factor analysis of Rorschach scoring 


20 


categories taken from data gathered by Beck, 
Rabin, Thiesen, Molish, and Thetford (1950) 
from a stratified population of 157 normal 
subjects. Wishner advanced the following hy- 
potheses to guide his investigation: 


1. A factor of psychological efficiency can 
be found. 

2. The factorial composition of FC is dif- 
ferent from that of CF and C. 

3. A factor of verbal productivity can be 
found. 


From his 20 variable (representing Beck’s 
main scoring categories) correlation matrix 
Wishner extracted four factors by the centroid 
method. After failing to rotate these factors 
to a satisfactory simple structure solution, he 
used his hypotheses and the data supporting 
them to guide his placement of hyperplanes 
in the rotational process. After evaluating the 
degree of agreement between his final rota- 
tional solution and his original hypotheses, 
he reported confirmation of Hypothesis 1, 
disconfirmation of Hypothesis 2, and support 
for Hypothesis 3. Wishner’s results failed to 
confirm Wittenborn’s finding that the fac- 
torial composition of FC differs from that of 
the other color categories, C and CF. 

The present analysis was undertaken in an 
effort to realize Wishner’s original plan of 
rotating factors derived from his correlation 
matrix to an acceptable simple structure solu- 
tion. In order to eliminate the possibility that 
Wishner’s failure to achieve a simple struc- 
ture solution could be due to the underextrac- 
tion of factors, his correlation matrix was re- 
factored by the principal components method 
and seven first-order factors extracted. The 


1 Following the suggestion of Kaiser (1960), factor 
extraction was continued until the latent roots be- 
came less than 1.00. The seventh factor has a root 
less than 1.00 but was included in the rotational 
work in order to insure a sufficiency of factors for 
an adequate rotational solution. 
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Factor ANALYSIS OF RORSCHACH 


TABLE 1 


ScorRING CATEGORIES, UNROTATED AND ROTATED Factor LOADINGS 


Unrotated factor loadings 


Rotated factor loadings 


Scoring categories 


R —03 


W% 

D% 

Dd% 

M% 

(C +CF)% 

(C + CF + FOC)% 
FC% 

Sum C/R 

(Y + YF +FY)% 
(V + VF +FV)% 
(All Vs + ¥s)% 
F% 


Note.—Decimal points are omitted. 


use of more than four factors was recom- 
mended by our interest in identifying as many 
as feasible of the independent dimensions un- 
derlying the scoring categories used as vari- 
ables. Most clinicians would probably expect 
these categories to represent more than four 
dimensions. The seven factors were then ro- 
tated analytically by Carroll’s (1953; Kaiser, 
1958) oblimin method to an oblique simple 
structure solution.* Using the same computa- 
tional procedures, two second-order factors 
were obtained and rotated toward simple 
structure. 


RESULTS 


The findings of this study are based on 
the correlation matrix presented by Wishner 
(1959, Table 2). 

The data collection procedures and the 
population of normal subjects are described 
by Beck et al. (1950), and by Wishner 
(1959). Of the 157 subjects, 71 were males 
and 86 were females. Their ages ranged from 
17 to 69, and their occupations were divided 
among executives, junior executives, skilled 

2 Thanks are due to John Carroll for providing his 
IBM 704 Oblimin rotational program, and to the 
Western Data Processing Center, U.C.L.A. Graduate 


School of Business Administration, for use of their 
computer facilities. 
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workers, 
workers. 

Table 1 contains the scoring categories, 
original factor loadings, and rotated first- 
order factor loadings (reported in the refer- 
ence vector system). The analytically rotated 
factor loadings can be seen to represent a 
compelling simple structure solution, with the 
possible exception of the first factor. Although 
further rotation of Factor 1 seems desirable 
to place more loadings in the near-zero 
range, this was not attempted because the 
interpretation of this factor proved to be clear 
as it stands. Table 2 presents the correla- 
tions between the primary factors. The cal- 
culation of second-order factors, given in 
Table 3, was undertaken in order to illumi- 
nate the major dimensions underlying the 
scoring categories. The above-stated hypothe- 


semiskilled workers, and unskilled 


TABLE 2 


CORRELATIONS BETWEEN PRIMARY FACTORS 


Factors 
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TABLE 3 


SECOND-ORDER Factor LOADINGS 


Rotated 
loadings 
Factor 


S1 S2 


. Color 

. Perceptual accuracy 
. Human movement 

. Shading 

. Organizational ability 
. Vista 

. Restricted content 


79 
—02 
31 
72 
31 
28 


Note.—Decimal points are omitted. 


ses about psychological efficiency and verbal 
productivity factors might possibly be con- 
firmed in the second-order rather than the 
first-order realm. 


INTERPRETATION OF THE FACTORS 


Let us next take up the meaning of the 
seven first-order factors in terms of the vari- 
ables loading high on them. For this purpose 
all loadings outside of the limits +.31 (i.e., 
10% of the variance) will be considered to 


be significant. 

Factor 1. This factor has high positive 
loadings on Sum C/R, (C + CF)%, and (C 
+ CF + FC)%, Z/R, and FC%, and nega- 
tive loadings on F%, Dd%, and T/R. This 
appears to be a color factor and to include 
the FC as well as C and CF categories. The 
significant Z/R loading suggests that this 
color factor is associated with combinational 
ability, an association which implies that 
some flexibility and intellectual skills are 
represented by C and CF responses. We can 
interpret the negatively loaded F% and Dd% 
variables as due to the circumstance that 
color responses cannot be F responses and 
are not apt to be Dd responses in normals. 

Factor 2. The second factor has high posi- 
tive loadings on P%, F + %, and A%, and 
negative loadings on R and Dd%. This is 
clearly a perceptual accuracy factor. The 
negative loadings on R and Dd suggest that 
high perceptual accuracy is associated in nor- 
mals with a restriction on R and Dd. 

Factor 3. This factor loads high on M% 
and H% positively, and F% and A% nega- 
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tively. This is a human movement factor. The 
negative loadings can be attributed to the 
mutual exclusiveness of M and H with re- 
gard to F and A in the scoring system. 

Factor 4. The high loadings on this factor 
are (Y + YF + FY)%, (all Ys and Vs)%, 
FC% positively, and F% negatively. This is 
clearly a shading factor, and one which is 
loaded FC but not C and CF. The high load- 
ing on (all Ys and Vs)% can be attributed 
to the influence of the Y responses in this 
variable. 

Factor 5. Positive high loadings on this 
factor are W%, Z/R, and the negative load- 
ing is D%. This is a factor or organizational 
ability. Intelligence is also probably related 
to this factor, but in the absence of evidence 
of its correlation with an external criterion 
of intelligence, we will call it an organiza- 
tional ability factor. 

Factor 6. This factor has high loadings on 
(V+VF + FV)% and (all Ys and Vs)% 
positively, and FC% negatively. Factor 6 
may be termed a vista factor. The high load- 
ings on (all Ys and Vs)% can be attributed 
to the vista responses included in the scoring 
of this variable. 

Factor 7. The seventh factor has a nega- 
tive loading on N/R for its highest loading. 
A% is the high positive loading. This factor 
seems to refer to a restricted range of content. 

The two second-order factors appear to 
represent dimensions of cognitive and affec- 
tive responsivity, respectively. The cognitive 
dimension, factor $1, is bipolar. Drawing 
from current interpretative principles, we 
speculate that on one extreme the high load- 
ings for perceptual accuracy and organiza- 
tional ability suggest an external focus of 
cognitive activity, whereas the negative high 
loadings on vista, human movement, and re- 
stricted-content factors identify the opposite 
pole with internally focused cognitive activ- 
ity. The S2 factor with its major loadings for 
color and shading clearly represents the di- 
mension of affective experience. 


Discussion 


Looking at the /#? column of Table 1, we 
note that the seven first-order factors ac- 
count for at least 50% of the variance for 
each of the 20 variables. The studies of Wit- 
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tenborn and Wishner, in contrast, both re- 
ported that seven of their 20 variables did not 
contribute as much as 50% of their variance 
to the results. Thus, the present findings can 
be considered well grounded in the common 
factor variance of the variables under analysis. 

It is most striking that the basic scoring 
categories represented in the 20 variables 
show such minimal overlapping and _ inter- 
mixing in the structure of the first-order fac- 
tors. The identification of color, human move- 
ment, shading, organizational ability, vista, 
and perceptual accuracy factors which are 
only slightly correlated provides extremely 
cogent support for the separate scoring of 
these variables. 

The second-order factors uphold the fa- 
miliar distinction between affective and cog- 
nitive dimensions in the Rorschach test. And 
yet, these dimensions overlap just as we ex- 
pect affect and cognition to interact in the 
functioning personality. The shading and 
color factors have only negligible loadings on 
the cognitive factor, but the human move- 
ment, organizational ability, and vista fac- 
tors all contribute some variance to the affec- 
tive dimension. The implication is that M, Z, 
and V all involve some affective responsivity 
secondary to their cognitive meaning but that 
this responsivity is much weaker than that 
represented by the color and shading re- 
sponses. In this context, the finding that per- 
ceptual accuracy has no comparable loading 
on the affective factor suggests that percep- 
tual accuracy is the variable most antithetical 
to affective responsivity. 

The structure of the second-order factors 
suggests that the vista variable is more closely 
related to a cognitive process than to affect 
or a feeling experience, as represented by 
shading or color responses. The second-order 
loadings (Table 3) for the vista factor (Fac- 
tor 6) closely parallel the loadings for the 
human movement factor (Factor 3), and we 
interpret this as indicating: (a) the cognitive 
activity of vista responses, and (5) the likeli- 
hood that vista responses, like human move- 
ment responses, involve or are associated with 
internal representations of people. The latter 
point fits well the interpretation of vista re- 
sponses as self-appraisal. Rorschach test users 
may take these findings to indicate that dys- 


phoric affect is not necessarily implied by 
vista responses, and that some sensitivity to 
the self and others as objects of thought is 
implied by the vista response. 

An important general question can be an- 
swered from the results of the second-order 
factors, namely the question of the funda- 
mental dimensions of Rorschach’s test as rep- 
resented in the present variables. Four clus- 
ters of variables are distinguishable, Cs and 
Ys, F+ and P, M and V, W and Z. The 
other variables, R, FC, F, range of content, 
and 7/R, etc. all relate to these main lines 
of the personality configuration after the 
fashion indicated by the first-order factors. 

Let us now consider Wishner’s three hy- 
potheses. 

1. The hypothesis of a psychological effi- 
ciency dimension proposes a factor represent- 
ing a continuum running from flexibility of 
response at one pole to rigidity of response 
at the other. Wishner regarded F% as most 
representative of the rigidity pole because of 
its correlation with general alertness (Wish- 
ner, 1953). He expected T/R and A% also to 
represent the rigidity pole and M%, C, Y, 
and V to define the opposite pole, flexibility. 
Factor 1 fits part of this expectation very 
closely, for we find the color determinants 
and Z/R at the positive pole (i.e., spontane- 
ous emotional responsivity and the ability to 
educe relationships) and F%, Dd%, and T/R 
(i.e., attention to form, attention to small de- 
tails, and deliberateness) on the negative side. 
Factor 1, then, can be interpreted as a bipolar 
factor representing a continuum of flexibility 
—trigidity. We should, however, also note that 
the F% variable loads negatively on two 
other factors, thus suggesting a possible flexi- 
bility—rigidity dimension in regard to hu- 
man movement (Factor 3) and shading (Fac- 
tor 4) as well as for Factor 1. It is in this 
broad sense that the present results support 
Wishner’s first hypothesis; the color, shading, 
and human movement categories can each be 
viewed as one pole of a continuum which has 
attention to form as the opposite dimension. 
In the present findings, the F% variable 
serves more as a contrasting effect within the 
various factors than as representing a basic 
dimension in either the first or second-order 
factor space. 
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2. The factorial composition of FC% in the 
present results is both different from and more 
complex than that of (C + CF)%. Yet it is 
not completely different from (C + CF)%. 
Although FC% has a relatively low but sig- 
nificant loading on the (C+ CF)% factor 
(Factor 1), it also loads just as highly on 
two other factors, the shading factor (Factor 
4) positively, and the vista factor (Factor 6) 
negatively. The high loading of the FC vari- 
able on the shading factor indicates that the 
affective experience of FC contains, along 
with the color component, an equal measure 
of the shading response. Interpretatively, we 
may speculate that it is the shading com- 
ponent that modulates, inhibits, opposes, or 
controls the experience of the color responses. 
In any case, Rorschach workers have for 
many years thought of FC as being different 
from C and CF responses, at least in the 
sense that FC is much further from CF than 
CF is from C. Although the factorial com- 
position of FC% supports Wittenborn’s con- 
clusion about differing subgroup factorial 
compositions existing within a class of scor- 
ing determinants, it cannot be said that this 
finding is not consistent with current inter- 
pretations of the meaning of the FC response. 

3. The verbal productivity hypothesis posits 
a factor with high loadings on Dd%, M% 
and (C + CF)%. The present analysis, how- 
ever, gives the M% and (C + CF)% vari- 
ables separate factorial status, so this hy- 
pothesis is not confirmed in its explicitly 
stated form for either the first or second-or- 
der factors. Nevertheless, productivity seems 
involved in the perceptual accuracy factor 
(Factor 2) which is loaded negatively by 
Dd% and R. This factor suggests that per- 
ceptual accuracy is contrasted with a great 
number of responses. In the second-order 
realm, the positive pole of S1 is also asso- 
ciated with perceptual accuracy. Also related 
to Wishner’s third hypothesis is the factor 
we have called organizational ability (Factor 
5), and we note that this factor, like Factor 
2, is represented on the positive pole of S1. 
Factor 5 might be interpreted as an intelli- 
gence factor such as Williams and Lawrence 
(1953) reported for a population of psychi- 
atric patients. Their intelligence factor loaded 
W, M, F, FC, and R as well as the Wechsler- 
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Bellevue Verbal and Performance IQ scores. 
However, Factor 5, with its W responses, or- 
ganizational ability, and deliberateness ap- 
pears more clearly defined either than the 
intelligence factor of Williams and Lawrence 
or than is implied by Wishner’s statement of 
his third hypothesis. 

Despite the above mentioned points of 
relevance, the present findings do not fit well 
into Wishner’s explicit expectations. The sepa- 
rate status of the basic scoring categories 
found here militates against the emergence 
of a verbal productivity or a psychologi- 
cal efficiency factor. Wishner’s Hypotheses 
1 and 3 assume that the various C, M, Y, 
and V variables which should load on these 
factors are not themselves independent di- 
mensions, and this assumption seems to be 
supported by his previous work (Wishner, 
1953). Methodological differences were per- 
haps most significant in producing the con- 
trast between Wishner’s work and that re- 
ported here. The use of a relatively large 
number of factors distinguishes this study 
most clearly from Wishner’s procedure. If 
only four factors had been used in this study, 
then, because some seven semiindependent 
dimensions are represented in the data, one 
would expect both a difficulty in rotating to 
a simple structure solution and a mixing of 
the seven dimensions to produce factors that 
combine these dimensions. The use of seven 
factors for this analysis seems to have avoided 
these problems and to have produced a dif- 
ferent character of findings. 


SUMMARY 


In factor analyzing the Rorschach scores 
of 157 normal subjects whose protocols were 
collected by Beck et al. (1950), Wishner 
(1959) hypothesized that: (a) There is a 
psychological efficiency factor; (6) The fac- 
torial composition of FC is different from 
that of C and CF combined; and (c) There 
is a verbal productivity factor. After first 
failing to rotate his four factors to an ac- 
ceptable simple structure solution, he used 
his hypotheses to guide his factor rotation 
and to provide criteria of evaluation. In this 
fashion, he was able to report confirmation 
of the first hypothesis, no confirmation of the 
second, and some support for the third. 
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The present study re:actored Wishner’s 20 
variable correlation matrix, extracting seven 
first-order factors. These were analytically 
rotated to a simple structure solution and 
were interpreted as: (a) a color factor, (5) 
a perceptual accuracy factor, (c) a human 
movement factor, (d) a shading factor, (e) 
an organizational ability factor, (f) a vista 
factor, and (g) a restricted range of content 
factor. 

The correlations between the above factors 
were then factor analyzed to produce two 
second-order factors which were also ana- 
lytically rotated to a simple structure solu- 
tion. These factors were interpreted as: (S1) 
a cognitive factor, and (S2) an affective fac- 
tor. The close parallel between the first-order 
factors and the Rorschach scoring categories 
was discussed. The FC% variable’s loading 
more significantly on the shading factor than 
on the color factor was noted, and the con- 
clusion was drawn that the vista variable im- 
plies internal cognitive activity akin to that 
represented by the human movement variable. 

The relationships between the above fac- 
tors and Wishner’s hypotheses were discussed, 
and it was concluded that the present re- 
sults provide partial support for his first two 
hypotheses. It was suggested that the differ- 


ences between this analysis and Wishner’s 
study could be primarily due to the number 
of factors extracted and rotated. 
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THE CONTROL OF SOCIAL DESIRABILITY: 


A COMPARISON OF THREE DIFFERENT Q SORTS AND A 
CHECK LIST, ALL COMPOSED OF THE SAME ITEMS 


WILLIAM S. KOGAN 
Veterans Administration Hospital, Seattle, Washington 
anp WILBERT E. FORDYCE 
University of Washington, School of Medicine 


In the field of personality inventories con- 
siderable energy in recent research has been 
directed toward the problem of isolating and 
eliminating sources of variation that are con- 
founded with the variables an inventory is 
supposed to measure. This variance has usu- 
ally been considered distorting and capri- 
cious in its effect unless removed. The distort- 
ing variance has at various times been called 
test-taking attitude, response set (Cronbach, 
1946), etc. One illustration of these measures 
of unwanted variances is the F, K, L scales 
of the MMPI (Edwards, 1957a, 1957b; 
Meehl & Hathaway, 1946). The measures 
obtained are used as a “correction” for the 
other obtained scores. Edwards (1953b), de- 
fining the principal source of the unwanted 
variation as social ‘desirability (SD), hy- 
pothesized and demonstrated a stable rela- 
tionship between median social desirability 
(MSD) value of an item and the probability 
of its endorsement. He devised an inventory, 
Personal Preference’ Schedule (1953a), whose 
design attempted to resolve the effect of SD 
by minimizing it as a source of variation. 

Concurrently with the delineation of some 
characteristics of these unwanted variances 
and methods for controlling them, there was 
increasing interest in the possibility that they 
might have value and meaning for person- 
ality assessment. For example, F, K, and SD 
were found to be highly related among them- 
selves and with several of the clinical scales 
of the MMPI (Fordyce, 1956). In another 
study SD was shown to be related to a sick- 
ness-health continuum (Kogan, Quinn, Ax, & 
Ripley, 1957). Therefore, it seemed impor- 
tant to unconfound but retain for measure- 
ment rather than remove or minimize the 
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variance of SD. A similar position is expressed 
by Broen and Wirt (1958). 

This study was designed to compare four 
rating scale forms' with respect to two gen- 
eral questions: (a) their equivalence, i.e., 
correlations between corresponding variables 
on the different forms, and (5), the differ- 
ential effectiveness of different ways of con- 
trolling for SD. 

Within the question of equivalence a spe- 
cific question is, do a Q sort and a check list 
give comparable results. Block (1957) has 
answered this in the affirmative but his data 
were not based upon identical items. Within 
the question of differential effectiveness of 
ways of controlling for SD, we were interested 
in comparing first, check list and Q sorts; 
second, within the Q sorts the effect of vary- 
ing the degree of comparability of the MSD 
values of items competing for endorsement. 

It is important to note that this study 
does not involve validation. It is a meth- 
odological study concerned with elucidating 
characteristics of the rating scale ferms so 
that some practical decisions can be made as 
to which form is likely to produce variable 
scores least confounded with SD and also 
provide a measure of SD. 


METHOD 


A number of articles (La Forge & Suczek, 1955), 
in the literature dating back to 1951 described the 
construction and application of an 128-item check 
list emphasizing interpersonal rather than intra- 
personal variables. An interesting characteristic of 
this interpersonal check list was that its construc- 
tion imparted some equivalence of the MSD 
values to its variables. It was therefore used in 


1 Copies of the forms can be obtained on request 
from William S. Kogan, Veterans Administration 


Hospital, Seattle, Washington. 
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the present study as a reservoir of items having both 
a face validity structure and some known empirical 
characteristics. From this reservoir of 128 items 
(eight variables, 16 items each) a subsample of 64 
items (eight variables, 8 items each), was constructed 
such that the sum and range of MSD values for 
each of the eight variables of the instrument were 
quite closely equated. SD values were obtained as 
described by Edwards (1957a, 1957b) on an in- 
dependent sample of patients. Table 1 gives the 
sum and range of MSD values for the items for 
each of the eight variables. 


Construction and Scoring of Rating Scale 
Forms 


These 64 items were used in constructing the 
four forms studied. Forms 1 and 2 each consisted 
of four subsets of 16 items each, 2 from each vari- 
able. In Form 1, each subset of 16 items contained 
a restricted and comparable level of item SD values. 
Form 2, each subset had a range of item SD values, 
selected such that each of the eight variables had 
one comparably high and one comparably low SD- 
valued item. In Forms 1 and 2, subjects sorted 
each subset of items separately into a forced normal 
distribution. The forced normal distribution was 
2, 3, 6, 3, 2. Variable scores were calculated by 
summing the item placements across the four sub- 
set sortings. Form 3 consisted of a single 64-item Q 
sort. The forced normal distribution was 2, 5, 7, 
11, 14, 11, 7, 5, 2. Form 4 consisted of a check list 
form in which the items were listed on a page and 
subjects simply checked each item as being char- 
acteristic or uncharacteristic of the self. Table 2 
defines the term “level” employed in the text with 
reference to Forms 1 and 2. 

It will be noted that Forms 1, 2, and 3 offer, 
respectively, an increasing opportunity for items of 
differing MSD value to compete with each other. 
Form 4 provides an opportunity to compare the 
Q sort, or forced-choice forms, with the more com- 
mon inventory form. 

In addition to yielding scores on each of the eight 
variables, Forms 2, 3, and 4 yield SD scores. These 
are calculated by summing the degree of endorse- 


TABLE 1 


VARIABLES OF INTERPERSONAL CHECK LisT AND SuM 
AND RANGE OF MEDIAN SD VALvEs oF ITEMS IN 
VARIABLES 


Variable 


Name 
Managerial—Autocratic 
Competitive—Exploitive 
Blunt—Aggressive 
Skeptical—Distrustful 
Modest—Self-effacing 
Docile—Dependent 
Cooperative—Overconventional 
Responsible—Overgenerous 


Sum 
39.28 


Range 
2.21-7.59 
2.33-7.25 
2.38-7.76 
2.54-6.23 
2.50-7.31 
2.85-7.13 
2.94-7.47 
3.67-7.33 


TABLE 2 
MEAN AND RANGE MEDIAN SD oF LEVELS 


Level Mean 


High 
Middle high 
Middle low 
Low 


All 


ment of the 32 items (four from each variable) 
having the higher MSD values. In Form 4, the 
SD score is simply the number of these 32 items 
endorsed. The SD score becomes a variable to be 
correlated with other test variables. The average 
intercorrelation of the SD scores on the three forms 
was .76. In this study the SD score obtained from 
Form 3 was arbitrarily selected as the one to be 
correlated with variable scores from all forms. 


Subjects and Administration 


The subjects were 39 male veterans hospitalized 
on a psychiatric service of a General Medical and 
Surgical Hospital. The ages of the patients ranged 
from 19 to 69 with a mean of 39.8. 

The inventories were administered to the subjects 
in groups on four occasions, each 2 days apart, and 
counter balanced for order of presentation. The in- 
structions to the subjects were “describe yourself.” 


RESULTS 
Comparability of Forms 


The question of comparability of the forms, 
aside from the issue of control of SD, is 
reflected in the intercorrelations of variables 
among the four forms. These intercorrela- 
tions represent the mean correlation between 
eight variables on two forms: for example, 
PA Form 1 with PA Form 2, etc. The mean 
intercorrelations ranged between .63 and .70; 
for N = 39, correlation of .40 is significantly 
different from 0 at the 1% level of confidence. 

The level of the mean intercorrelations 
indicates a common pool of behavioral at- 
tributes are being tapped by the four forms. 
As a “comparable form” measure of test 
reliability, these mean intercorrelations ap- 
pear to fall within the lower limits of such 
intercorrelations obtained generally from per- 
sonality inventories. The homogeneity of the 
mean intercorrelations shows clearly that no 
one form departs importantly in pattern of 
tests results from the other forms. 


27 
| 
7.0 5.6-7.8 
: 6.2 5.0-7.3 
3.6 2.3-4.7 
2.9 2.2-3.7 
|__| 49 2.2-7.8 
bg 
Us 
q 
| PA 
BC 39.67 
+? DE 39.07 
FG 37.29 
HI 39.35 
sp JK 38.64 
LM 42.60 
NO 41.70 
| 


WitiiAm S. KoGAN AND WILBERT E. Forpyce 


TABLE 3 
{FREQUENCY OF SIGNIFICANT ¢ TESTS 
BETWEEN FORMS BY VARIABLE MEANS 


Form 


Form 2 3 4 


Significance 
level 


2 1 2 
1 0 3 


1 1 


Comparison of Control of SD in Four Forms 


The problem of the degree to which vari- 
able scores were confounded with or related 
to the SD factor was analyzed in three ways. 
First, each subject’s SD score, as calculated 
from Form 3, was correlated with his variable 
scores on all four forms. The results indicate 
that no one variable on any of the forms is 
heavily confounded with SD. Only one of the 
32 correlations involved reached a 1% level 
of significance. The mean and range of these 
correlations is comparable to those given in 
the manual of the Personal Preference Sched- 
ule (Edwards, 1953a). From these findings it 
is inferred that the methods of controlling 
SD used here are generally comparable and 
that they are as effective as the forced choice 
method used by Edwards. 

While the correlational analyses so far re- 
ported indicate comparability of forms on 
proportional measures, mean variable scores 
could still be differentially affected by or 
related to SD. Analysis of variance and ¢ 
tests were used to analyze the comparability 
of mean variable scores among the test forms. 
First, the means and distributions of variable 
scores were transformed to a common base, 
T scores. This was done by calculating the 
grand mean and standard deviation of the 
eight distributions of 39 scores each, obtained 
on each form. From these values, individual 
variable score distributions were transformed 
to T scores with a mean of 50 and standard 


deviation of 10. This was repeated on each 
form. Next, subjects were divided into High, 
Middle, and Low SD groups of equal N’s 
(13 in each group) according to their SD 
scores obtained on Form 3. Eight analyses 
of variance were carried out, one for each 
variable; three levels of SD, by four forms. 

These analyses reveal few significant find- 
ings between forms, between SD groupings, 
or on the interaction of these two factors. 
Generally speaking, mean variable scores ob- 
tained from each of the test forms appear to 
be comparable estimates of the parameter 
from which they were drawn. Similarly, sepa- 
rating subjects according to their SD scores 
fails to reveal systematic differences with 
respect to mean variable scores across the 
four forms. 

Despite the findings of the analyses of 
variance, the possibility remained that there 
may have been a tendency for one of the four 
forms to have a systematic rank position with 
regard to the other forms on mean variable 
scores. Analysis of the four together might 
obscure this trend. To explore this further, 
a more detailed analysis of mean variable 
scores was carried out by comparing each 
form with every other form by ¢ test on each 
of the eight variables. The results of this 
analysis are summarized in Table 3. 

It should be noted that if the four forms 
yield comparable mean variable scores, what- 
ever significant mean differences occur, as 
measured by ¢ test, should be approximately 
equally distributed among the four forms. 
The data given in Table 3 indicate this is not 
the case. A total of 48 tests were calculated, 
24 involving each of the four forms. Table 3 


TABLE 4 


COEFFICIENT OF CONCORDANCE 


SD group 


Forms Low 


High Middle 


1 a 13 
2 
3 ase 
4 .19** 


*5% level of significance. 
** 1% level of significance. 
*** 1% level of significance. 


28 
05 
O1 
05 
A 2 
0 3 1 
2 05 
3 
3 01 | 
‘ 
| 
f 


CoNTROL OF SoctAL DESIRABILITY 29 


shows that a total of 19 ¢’s were significant, 
9 at the 5% level, 10 at the 1% level. Of 
these, 14 involved Form 4 (5 at 5% level, 9 
at 1% level), whereas only five were signifi- 
cant among the other three forms and four of 
these at only the 5% level. Both the number 
of significant ¢ tests and their distribution 
among the forms obviously departs from 
chance. Here then is clear evidence that so far 
as these conditions of measurement are con- 
cerned the four test forms studied show sys- 
tematic differences. The one form, Form 4, 
yields mean variable scores differing signifi- 
cantly from the other three forms. These 
other forms show comparable means. Our 
data indicate that these differences between 
Form 4 and the other forms are most prob- 
ably related to the SD factor. That is to 
say that Form 4 is the most susceptible to 
being influenced by variations in the balanc- 
ing for the SD variable. 

The comparability of Forms 1, 2, and 3 
is again evident from the data given in Table 
3. These data support the conclusion that in 
Q sort-type measuring instruments, SD is con- 
trolled equally well when items of equal or 
varying SD values are permitted to compete 
for endorsement, so long as there is an overall 
balance between item pools for each variable. 

The final method of analysis was to eval- 
uate similarity of test profiles across the eight 
variables by the coefficient of concordance W 
(Kendall, 1948). Comparisons were made for 
each of the High, Middle, and Low SD 
groups, form by form. These data are pre- 
sented in Table 4. 

The findings given in Table 4 show that all 
tests forms yield a significant amount of pro- 
file similarity for the Middle SD subjects but 
that only Form 4 shows homogeneity of pro- 
files, though only slightly so, for the High 
and Low SD subjects. This indicates that in 
profile analysis too the check list form is 
slightly different from the Q sort forms. 


DISCUSSION 


The various methods of analysis reported 
here show consistently that the three Q sort 
forms studied yield highly comparable data. 
As noted above, these findings indicate SD 
is controlled equally well when comparable 
or varying SD item values compete. Among 


the three forms of Q sort used here, Forms 2 
and 3 appear to have an advantage in that 
they permit calculation of an SD score in- 
dependently of the variable scores. These SD 
scores can be used as an independent variable. 

Although the data reveal some differences 
between the Q sort and check list forms, these 
differences are slight. There is little data here 
to support the conclusion that the check list 
form produces clearly different test results in 


_ a practical sense. That differences, however 


slight, occur between Form 4 and the other 
three forms only in those variables less com- 
parable with respect to MSD, supports the 
inference that SD item values affect variable 
scores and that Form 4, the check list, is more 
susceptible to this effect. 


SUMMARY 


1. The potential utility of unconfounding 
rather than removing SD as a source of 
variance in personality inventories was de- 
veloped. 

2. Four forms replicating the same items 
drawn from an interpersonal check list were 
constructed to compare the effect of items of 
differing levels of SD competing for endorse- 
ment and to compare three variations of Q 
sort construction with each other and with a 
check list. 

3. The data support the conclusion that 
equal or varying item SD levels yield com- 
parable results in variable scores and in con- 
trol of SD. 

4. The check list form was shown to differ 
slightly but consistently from the Q sort 
forms in mean variable scores and in homo- 
geneity of subject profiles. It was concluded, 
however, that these differences were so slight 
as to indicate the check list form has practical 
equivalence. The check list form was demon- 
strated to be more susceptible to minor varia- 
tions in the balancing process. 


REFERENCES 


Brock, J. A comparison of ipsative and normative . 
ratings of personality. J. abnorm. soc. Psychol. 
1957, 54, 50-54. 

Broen, W. E., Jr., & Wirt, R. D. Varieties of re- 
sponse sets. J. consult. Psychol., 1958, 22, 237-240. 

Cronpacu, L. J. Response sets and test validity. 
Educ. psychol. Measmt., 1946, 6, 475-494. 


ra 
al 
: f hy 
Ap 
ae 
|, 


30 WrtiaM S. KoGAN AND WILBERT E. Forpyce 


Epwarps, A. L. Manual for the Edwards Personal 
Preference Schedule. New York: Psychological 
Corporation, 1953. (a) 

Epwarps, A. L. The relationship between the judged 
desirability of a trait and the probability that the 
trait will be endorsed. J. appl. Psychol., 1953, 37, 
90-93. (b) 

Epwarps, A. L. Social desirability and probability 
of endorsement of items in the interpersonal check 
list. J. abnorm. soc. Psychol., 1957, 55, 394-396. (a) 

Epwarps, A. L. The social desirability variable in 
personality assessment and research. New York: 
Dryden, 1957. (b) 

Forpyce, W. E. Social desirability in the MMPI. 
J. consult. Psychol., 1956, 20, 171-175. 


Kenpatt, M. G. Rank correlation methods. London: 
Charles Griffin, 1948. 

Kocan, W. S., Quinn, R. D., Ax, A. F., & Ripey, 
H. S. Some methodological problems in the quan- 
tification of clinical assessment by Q-array. J. 
consult. Psychol., 1957, 21, 57-62. 

La Force, R., & Suczex, R. F. The interpersonal 
dimension of personality: III. An interpersonal 
check list. J. Pers., 1955, 24, 95-112. 

Meeut, P. E., & HatHaway, S. R. The K factor 
as a suppressor variable in the Minnesota Multi- 
phasic Personality Inventory. J. appl. Psychol. 
1946, 30, 525-564. 


(Received October 31, 1960) 


° 
@ 
3 
Ad 
. 
. 
5 = 
ihe 
te 
1 
| 
} 
| 
4 


Journal of Consulting Psychology 
1962, Vol. 26, No. 1, 31-37 


A COMPARISON AND REINTERPRETATION OF 


FACTOR STRUCTURES OF THE W-B, THE 
WAIS, AND THE HAWIE’' ON 


A number of factor analyses of the 
Wechsler Intelligence Scales (WAIS, W-B, 
and HAWIE) on adult or aged subjects have 
been conducted (Balinsky, 1941;  Birren, 
1952; Cohen, 1957; Goldfarb, 1941; Mundy- 
Castle, 1958; and Riegel, 1960). The results 
and their interpretations, however, have been 
contradictory. Birren identified four factors 
in the W-B responses of 60- to 74-year-old 
persons. These factors differed markedly from 
those reported by Cohen (1957) for almost 
the same age group (60 to 75+ years) on the 
WAIS. They corresponded closely, however, 
to Cohen’s factor structures* of the age 
groups 18 to 19, 25 to 34, and 45 to 54 years. 
Cohen himself found that the factor structure 
for the oldest group differed noticeably from 
those for the three younger age groups and 
used this result as indication of changes in 
factor structure during later years of life. 
In light of Birren’s findings this conclusion 
can be maintained only, if the discrepancy 
between W-B and WAIS factor structures of 
the age group 60 years and older cannot be 
clearly attributed to differences in the forms 
of the tests, and/or to particular differences 
in the techniques of rotation employed. Ac- 
cordingly, the following problems suggested 


HAmburg-Wechsler-/ntelligenztest fiir Erwachsene. 

2Section on Aging, Laboratory of Psychology, 
National Institute of Mental Health, Bethesda, Mary- 
land. The preparation of this paper was completed 
by the senior author during the tenure of a Post- 
doctoral Fellowship from the Institute of Mental 
Health, United States Public Health Service. The 
authors are greatly indebted to J. E. Birren for his 
support and constructive criticism. 

3 The term factor structure refers to the configura- 
tion or pattern of factor loadings whereas changes 
in factor loadings that do not alter the interpreta- 
tion of the factors will not be called structural 
changes. 
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by the previous findings are to be recon- 
sidered: 


1. Does the factor structure of the Wechsler 
test, for the age group 60 years and older, 
vary among the different versions of the test? 

2. Are there changes in the factor structure 
of the Wechsler test with advancing age, i.e., 
changes which may occur independently of 
any growth and decline in test performance? 

Since recently the factor structure of the 
German version of the Wechsler test 
(HAWIE) has been made available our study 
is extended to a comparison which bears on 
the question: 

3. Are there cultural differences between 
the factor structures of the Wechsler tests for 
the age group 60 years and older? 


Factor analysis is a most useful tool where 
a large number of interrelated measures is 
to be reduced to a few underlying factors, a 
linear combination of which represents any of 
the measured variables. Since the Wechsler 
tests consist of 10 subtests only (or 11 when 
the vocabulary is included) factor analyses of 
this scale are of limited value. The most one 
can expect is information on the stability of 
the test structure between age groups, popula- 
tions, and versions. One can hardly expect to 
learn much about the intellectual structure of 
older persons, since the number of subtests 
included in the battery is not large enough for 
such an analysis. 

For this reason the purpose of our in- 
vestigation is limited and represents mainly a 
critique and re-evaluation of studies in which 
such attempts have been made. A_ second 
purpose is the application of more appropriate 
methods for comparisons of different struc- 
tures. 


4 
ink 
3 
at 
a 
q 
4 
ot! 
4 
f 
am 
31 
1: 


MetTHop 


Tests 


While the W-B and WAIS are sufficiently known, 
brief remarks on the HAWIE have to be made. 
The HAWIE is a translation of the W-B. Only some 
test items have been slightly modified. The stand- 
ardization resembles that of Wechsler (1939) and 
was conducted by Anne Hardesty (Wechsler, 1956) 
on 1,831 persons of the age range from 10 to 60 
years. The standardization of the HAWIE was 
extended and partially repeated for the age groups 
55 to 75+ years by Riegel and Riegel (1959) and 
was based on the test results of 380 persons in 
Northern Germany. This extension resembles in 
many respects the standardization of the WAIS 
(Wechsler, 1955) for the older age groups by 
Doppelt and Wallace (1955). However, to most 
subjects only three or four alternate subtests were 
given, each combination allowing for an equally 
good estimate of the total test scores. The complete 
matrix of subtest intercorrelations was derived from 
a subgroup of 128 subjects to whom the whole 
test battery was administered and who closely 
resembled the total group of 380 subjects with 
respect to mean subtest performances. For a de- 
tailed discussion of the procedures applied, see 
Riegel and Riegel (1959). 


Techniques of Comparisons 


Thurstone’s (1947) method of rotation to simple 
structure is of particular value where psychological 
meaningful factor patterns are to be _ identified 
(Birren, 1952; Goldfarb, 1941; Mundy-Castle, 1958). 
However, when the main goal of the analysis is 
to compare different factor structures with each 
other, this technique is not satisfactory because no 
objective criteria are available to indicate when 
simple structure is attained. It is possible, therefore, 
that differences between factor matrices which have 
been rotated to simple structure are not inherent 
in the data but are merely dué to differences in 
opinion as to when simple structure has been 
obtained. Among the methods available to yield 
comparable factor matrices are: Hotelling’s prin- 
cipal components method (1933), and Ahmavaara’s 
transformation analysis (1957). 

The value of Hotelling’s method lies in the fact 
that it yields unique solutions and thus enables 
the experimenter to compare the factor (component) 
structures of different groups. In an earlier investiga- 
tion by one of us (Riegel, 1960) this method did 
not allow for a reliable identification of more than 
two factors (components), since these accounted 
for 65% of the total and for 90% of the common 
variance. In the studies under consideration more 
detailed factor matrices have been reported. In 
order to compare these matrices a method such as 
the transformation analysis of Ahmavaara has to be 
applied. 

The method of Ahmavaara allows for a rotation 
of one matrix in such a way that it corresponds 
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most closely to the matrix of comparison. The trans- 
formation matrix is obtained by the following 
formula: 


L= 1) iy) 


Here, L denotes the transformation matrix, which 
has to be normalized by rows before its application. 
F; and Fy denote the factor matrices under con- 
sideration. Matrices rotated to simple structure 
(even to a slightly oblique one) may be compared 
by this method. 

Ahmavaara demonstrated the correspondence be- 
tween two transformed factor matrices by plotting 
all factor loadings of the two matrices against each 
other, and suggested the following index: 


Fy Ly 
N 


A= 


This index, however, would falsely indicate a low 
congruence, if the factor loadings of one matrix are 
systematically lower than those of the other, but 
do not differ relative to each other. Because of 
this shortcoming Pearson’s coefficient of correlation 
rather than Ahmavaara’s index was used in our 
analysis to indicate the relationship between the 
different factor matrices. 


RESULTS 


The three questions formulated above were 
tested by comparing a number of different 
factor structures reported in the recent litera- 
ture. The factor matrix of the HAWIE was 
in each case the standard matrix of compari- 
son. For each comparison a transformation 
matrix, Z, was obtained. The matrices result- 
ing from the multiplication of. the factor 
matrix of the HAWIE with the particular 
transformation matrices, 1, are supposed to 
correspond closest to the other factor matrices 
under consideration. 

In particular, the HAWIE was compared 
with the factor structure of the WAITS 
(Cohen) and the W-B (Birren). In addition, 
an unrotated factor matrix of the WAITS for 
Cohen’s age group of 70 to 74 years was 
derived by the authors. This seemed advisable 
since Cohen himself had used an oblique 
rather than orthogonal rotation. Finally, an 
unpublished factor analysis of the WAIS for 
the age groups 20 to 74 years was made avail- 
able by Birren to the authors and thus the 
following structures were tested for similarity: 


1. Factor structure of the HAWIE for the 
age group 65 to 75+ years (Riegel, 1960). 
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TABLE 1 
Factor Matrix or tHe HAWIE Arrer 
ORTHOGONAL ROTATION 


Subtests 


General Information 
Comprehension 

Digit Span 

Arithmetic 

Similarities 

Coding 

Picture Arrangement 
Picture Completion 

Block Design 
Object Assembly 


Note.—All Subjects 65 to 75 years old. 


Five centroid factors were extracted, and 
orthogonally rotated as shown in Table 1. 

2. Factor structure of the W-B for the age 
group 60 to 74 years (Birren, 1952). Four 
factors were extracted and rotated to a 
slightly oblique simple structure. In Table 2 
these factors are compared with the cor- 
responding factors of the HAWIE, the latter 
being rotated to closest similarity with the 
first. The degree of similarity is revealed by 
a correlation coefficient of r = .92. 

3. Factor structure of the WATS for the age 
group 60 to 75 years (Cohen, 1957). This 
analysis is based on the data of Doppelt’s 
(Doppelt & Wallace, 1955) extension of the 
standardization. The four factors identified 
after rotation to an oblique simple structure 
are compared in Table 3 with the correspond- 


ing factors of the HAWIE, the latter being 
rotated to closest similarity with the first. The 
degree of correlation is r = .66. 

4. Factor structure of the WAIS for the 
age group 70 to 74 years (Riegel, present 
publication). The factor analysis is based on 
the correlation matrix reported by Doppelt 
and Wallace (1955). Four centroid factors 
were extracted by the present authors and 
are compared in Table 4 with the correspond- 
ing factors of the HAWIE, the latter being 
rotated to closest similarity with the first. 
The degree of correlation is r = .93. 

5. Factor structure of the WAIS for the 
age groups 20 to 74 years (Birren, un- 
published). This factor analysis is based on 
the standardization sample of the test 
(Doppelt & Wallace, 1955; Wechsler, 1955). 
Age and education were included as variables. 
The centroid factors were rotated to an 
orthogonal structure and are compared in 
Table 5 with the corresponding factors of the 
HAWIE, the latter being rotated to closest 
similarity with the first. The degree of cor- 
relation is r = .96. 

All correlation coefficients summarized be- 
low are significantly different from zero (p 
< .001). The correlations do not differ sig- 
nificantly from each other (p > .10), except 
when compared with the correlation on the 
HAWIE and WAIS by Cohen (p < .001). 
These interpretations have to be accepted 
with some precautions, however, since the 
sampling distributions of such correlations 


TABLE 2 


COMPARISON OF THE TRANSFORMED HAWIE Factor Matrix with tHe W-B 
Factor Matrix OF BrrrEN Hotpinc Rotation CONSTANT 


\ 


Subtests 


HAWIE W-B 


General Information 95 
Comprehension 88 
Digit Span 65 
Arithmetic 80 
Similarities 7 
Coding 76 
Picture Arrangement 73 
Picture Completion 78 
Block Design 76 
Object Assembly 54 


HAWIE 
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W-B HAWIE W-B HAWIE 
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TABLE 3 


COMPARISON OF THE TRANSFORMED HAWIE Factor Matrix wit tHE WAIS Factor 


MATRIX OF COHEN HoLpING ROTATION CONSTANT 


A 


Subtests 


B G 


HAWIE HAWIE WAIS 


General Information 
Comprehension 

Digit Span 
Arithmetic 
Similarities 

Coding 

Picture Arrangement 
Picture Completion 
Block Design 

Object Assembly 


.10 
—.10 
03 
02 
.20 
10 
38 
00 
03 


11 
37 
32 
35 
- 09 
07 


— 
01 
09 
.06 
10 
.29 
.20 
04 
58 
50 


Al 
38 
45 
52 
10 
21 
09 
O1 
09 
03 


A3 
20 
.26 
.23 
38 
AS 
.22 
58 
AS 
34 


among transformed factor matrices are un- 
known. 


HAWIE: W-B by Birren 

HAWIE: WAIS by Cohen 
HAWIE: WAIS by Riegel 
HAWIE: WAIS by Birren 


INTERPRETATION 


The first question on differences in factor 
structures between different versions of the 
test, can be answered by comparing the 
correlation between HAWIE and W-B (Bir- 
ren) with that between HAWIE and WAIS 
(Cohen). The latter is significantly lower 
than the first, however, it differs also signifi- 
cantly from the other correlations between 
HAWIE and WAIS (Birren, Riegel). Thus, 


it seems reasonable to conclude that the de- 
viation of the factor structure of the WAIS 
(Cohen) from the other factor structures, is 
due to differences in opinion as to when 
simple structure is obtained and to the rota- 
tion procedures applied rather than to dif- 
ferences in the different versions of the test 
and inherent in the data. 

Since no comparisons with the factor struc- 
tures of younger age groups have been in- 
cluded in the present analysis the second 
question on changes in factor structure with 
advancing age, as recently proposed by 
Cohen (1957) and earlier by Balinsky 
(1941), Garrett (1946), Spearman (1927) 
and others, can be answered tentatively only. 
Relying on our first conclusion, it is again 


TABLE 4 


COMPARISON OF THE TRANSFORMED HAWIE Factor Matrix THE WAIS Factor 
Matrix oF RreGeEL ROTATION CONSTANT 


HAWIE WAIS 


Subtests 


HAWIE 


B 


D 


WAIS HAWIE WAIS 


WAIS HAWIE 


General Information 
Comprehension 
Digit Span 
Arithmetic 
Similarities 

Coding 

Picture Arrangement 
Picture Completion 
Block Design 
Object Assembly 
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TABLE 5 


COMPARISON OF THE TRANSFORMED HAWIE Facror Matrix with THE WAIS Factor 
Matrix OF BrrrREN HoLpING ROTATION CONSTANT 


Subtests HAWIE WAIS 


General Information 
Comprehension 
Digit Span 
Arithmetic 
Similarities 

Coding 

Picture Arrangement 
Picture Completion 
Block Design 

Object Assembly 


quite possible that the differences in factor 
structures between age groups as reported by 
Cohen (1957) are due to the procedures used 
and not inherent in the data. This is espe- 
cially likely since there is good correspond- 
ence in factor structure between Cohen’s 
younger age groups and those reported for 
old subjects by Birren (1952) and Riegel 
(1960). Thus, only Cohen’s old age group 
deviates in factor structure from his younger 
groups. These comparisons have not been 
made, however, by applying Ahmavaara’s 
technique. 

Further evidence to the second question 
can be derived from Birren’s unpublished 
study of the WAIS in which subjects ranged 
in age from 20 to 74 years. The factor struc- 
ture of this larger age group deviates as little 
from that of the HAWIE for the age group 
65 to 75+ years (r= .96), as the factor 
structure of the WAIS (Riegel) for the age 
group 70 to 74 years (r = .93). Finally, also 
the factor structures of the HAWIE as deter- 
mined by using Hotelling’s principal compo- 
nent method did not reveal structural dif- 
ferences between the age groups 20 to 34, 35 
to 49, 50 to 64, and above 65 years (Riegel, 
1960). 

In answering the third question on dif- 
ferences in factor structure between different 
populations (American and German) we have 
to rely again on the above correlations. In 
view of the high similarity between HAWIE 
and W-B (Birren) as well as between HAWIE 


HAWIE WAIS 


HAWIE WAIS HAWIE WAIS 


and WAIS (Birren and Riegel) it seems safe 
to conclude that no differences are revealed 
by administering the Wechsler test to dif- 
ferent populations. 

Summarizing these findings, our conclusions 
can be considerably simplified. All one can 
conclude is that there are no differences in 
factor structures between different versions of 
the Wechsler test, between different popula- 
tions and seemingly between different age 
groups. The only deviation in factor structure 
of the WAIS as reported by Cohen seems to 
be due to the rotation procedures applied and 
possibly to differences in opinion as to when 
simple structure is obtained. 


DISCUSSION 


The results suggest that the methods of 
extraction and rotation influence the factor 
structures. In our opinion, the criteria for 
simple structure are not sufficient to determine 
factor structures that are to be compared with 
each other. Differences noted may be pri- 
marily due to differences in opinion as to 
when simple structure is obtained, rather than 
to differences inherent in the data. In par- 
ticular, this will be true if factors are rotated 
to an oblique rather than orthogonal struc- 
ture, since here additional subjective clues 
are influencing the decisions that have to be 
made. Generally, greater precaution ought to 
be observed in the interpretation of factors 
but especially of differences between factor 
structures. The transformation analysis seems 
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to be a useful analytic method to warrant 
comparability and thus represents a means 
of integrating the multitude of factor analyt- 
ical findings. The methods of rotation by 
electronic computers may also serve this 
purpose. These methods yield unique solu- 
tions but only approximations of simple 
structure. 

It is unlikely that differences in factorial 
structures of the Wechsler tests between age 
groups can be detected. Only few factors 
were identified which represent the most 
general characteristics of the test such as 
general intellectual, verbal-performance, nu- 
mercial, or speed aspects. As revealed by an 
earlier analysis (Riegel, 1960) the most one 
can detect in such comparisons are differences 
in factor loadings of subtests and thus cues 
about changes in their diagnostic value. Since 
it has to be also realized that each subtest is 
composed of a number of different variables, 
more distinct results for a psychology of 
aging may be obtained by analyzing the 
subtests themselves taking the item inter- 
correlations into account. 

Generally, the failure to find differences in 
factor structures between different groups and 
conditions may be due to restrictions in the 
range of abilities sampled by the Wechsler 
tests. Accordingly, one common factor ac- 
counts for the major part of the variation 
and little variability is left to more specific 
factors that could be expected to differ be- 
tween groups and conditions. Our principal 
component analysis (Riegel, 1960) yields 
clear evidence to this point. 

No cultural differences in the factor struc- 
tures of the Wechsler tests were noticed. 
Thus, in spite of some changes in items the 
main features seem to have been maintained in 
the German version of the test. However, 
more detailed investigation of cultural dif- 
ferences should be based again on item statis- 
tics because some subtests of the W-B, the 
WAIS, and the HAWIE differ in length from 
each other. Crude comparisons of the rate of 
growth and decline have been made between 
the two populations by using standard scores, 
ie., expressing the mean subtest scores within 
one age group in terms of the highest average 
achievements for each subtest, namely the 
achievements of the age group 20 to 34 years. 
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These comparisons on the W-B and HAWIE 
(Riegel, 1958) and on the WAIS and HAWIE 
(Birren, 1959) have not shown any marked 
differences between the American and Ger- 
man populations for the age range from 20 
to 75+ years. 


SUMMARY 


Factor analyses conducted on the WAIS 
and W-B (age groups 60 to 75+) have in- 
dicated: (a) WAIS factor structure (Cohen) 
differs from W-B factor structure (Birren). 
(6) WAIS factor structure of the age group 
60 to 75+ differs from those for the younger 
age groups (Cohen). In the present study 
these findings were reconsidered, applying 
more suitable methods for comparisons 
(Ahmavaara). (c) W-B and WAIS were com- 
pared with HAWIE factor structure in order 
to analyze cultural differences. 

The results failed to show: (a) Differences 
in factor structure between different versions 
of the test for the age groups above 60. (+) 
Sufficient evidence for changes in factor struc- 
tures with advancing age. (c) Differences in 
factor structures of the American and German 
populations. 
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SIMILARITY AND SUCCESS IN THERAPEUTIC DYADS' 


ROBERT C. CARSON 


Duke University 


Despite repeated reference in the literature 
of psychotherapy to the importance of the 
nature of the patient-therapist relationship in 
determining the course and outcome of 
psychotherapy, extremely few studies on the 
problem have been reported. Many investi- 
gators have sought to relate various person- 
ality variables of patients to success in 
therapy; some, but many fewer, similar 
studies have been made of therapists. Studies 
of patient-therapist “fit” as it affects out- 
come, however, have rarely been undertaken. 
The present study is of this dyadic type, in 
that it is concerned with the effect of patient- 
therapist personality similarity upon thera- 
peutic success. 

The relationship between general patient- 
therapist personality similarity and thera- 
peutic success has been studied previously by 
Axelrod (1951), Tuma and Gustad (1957), 
and Schopler (1958) using various definitions 
of “similarity” and “success.’’ While the 
results of none of these studies are unequivo- 
cal, only Schopler failed to find evidence for 
a positive relationship. Heine and Trosman 
(1960) and Orlinsky (1959) report a positive 
correlation between similarity of patient and 
therapist expectations of the relationship and 
duration of therapy. Vogel (1959) obtained 
partial confirmation of his hypothesis that 
similarity with respect to authoritarian be- 
havior and attitudes of patient-therapist 
pairs would enhance their degree of satisfac- 
tion with the relationship. In several unpub- 
lished studies, however, the present authors, 
working on the seemingly well-founded as- 
sumption that global patient-therapist per- 
sonality similarity and therapeutic success 
would be related in linear fashion, obtained 
generally disappointing results. A subsequent 
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rethinking of the problem led to a further 
analysis of the data from a different stand- 
point. 

We reasoned that if, as has often been 
suggested, psychotherapeutic success depends 
upon the therapist’s being able to achieve an 
optimum balance between empathy and ob- 
jectivity in dealing with his patient, then 
the relationship between patient-therapist 
similarity and therapeutic success might well 
be of curvilinear form. With very high simi- 
larity the therapist might be unable to main- 
tain suitable distance and _ objectivity, 
whereas in the case of great dissimilarity 
he would not be able to empathize with or 
understand the patient’s problems; in either 
case one might expect a decrement in thera- 
peutic success. Formally stated, our hypothe- 
sis in the present study was as follows: As 
patient-therapist personality dissimilarity in- 
creases through a range from maximum to 
minimum congruence, there will be a corre- 
sponding increase in therapeutic success up 
to a certain (unspecified) point, beyond 
which further increases in dissimilarity will 
be associated with decreasing success of the 
therapeutic relationship.” 


METHOD 


Senior medical students at the University of Chi- 
cago School of Medicine are required as part oi 
their training in psychiatry to take an 18-week 
course which consists primarily of treating psycho- 
therapeutically, under supervision, a selected patient 
who has been referred to the Psychiatry Clinic. Our 
subjects were patients and therapists drawn from 
this program during four successive academic quar- 


* The curvilinear 
not obvious in the few scatter plots originally 
constructed from more limited data. The specific 
hypothesis tested was not a direct product of these 
original data analyses, but was temporally prior to 
the analysis upon which this paper is based. 
Schopler (1958), whose work came to our at- 
tention after the completion of the present study, 
had proposed this hypothesis originally in his study. 
However, his findings led him subsequently to 
hypothesize a relationship of a rather different form, 
based upon certain tenuous trends in his data. 


form of the relationship was 
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ters. From a total population of some 70 patient- 
therapist pairs, we were able to collect all data 
reievant to this study on 60 cases. All patients were 
told that their treatment would be limited to 18 
weeks, at the end of which period they would either 
be terminated or referred elsewhere. With rare ex- 
ceptions, patients were seen once a week for their 
psychotherapy interviews. 

The patient sample contained 35 females and 25 
males, with an average age of about 38 years. The 
range of disorders for which the patients had sought 
treatment is probably not atypical for an adult 
psychiatric out-patient clinic, with the exception that 
psychotic disorders probably were underrepresented 
by virtue of the program’s selection process. With 
very rare exception neither patient nor therapist 
had had any prior direct experience with formal 
psychotherapy. The therapists, of whom eight were 
female, were unselected except for their status as 
fourth-year medical students. 

The MMPI was used to similarity of 
personality between patient and therapist. All pa- 
tients and therapists were given the booklet form 
of the MMPI at the beginning of treatment. Using 
fairly liberal validity scale criteria (1 < 70, F < 80, 
K < 75), no profile in either group was considered 
invalid. Although there are many possible methods 
of deriving global indices of similarity between 
two MMPI profiles (Cronbach & Gleser, 1953), 
Hathaway and Meehl, after extensive investigations 
of the relative validity of various methods, con- 
cluded that simple rank-order correlation of the two 
sets of T scores is probably the most efficient index 
available. This was the method of similarity assess- 
ment chosen for the present study. It is important 
to note that this method will yield a measure of 
similarity of profile shape exclusively, since the 
variance associated with elevation and scatter is 
discounted in the computational process. In terms of 
Cronbach’s (1958) component analysis of dyadic 
similarity measures, profile means and standard 
deviations are automatically equalized, and only the 
residual correlational component determines the 
magnitude of the similarity index. 

The specific operations used in computing the 
similarity index were as follows. All profiles were 
K-corrected in the usual manner at the outset. The 
Mf T score for each female subject (whether a 
patient or one of the eight female therapists) was 
transformed into a 7 score representing the same 
amount of deviation from the standardization mean 
as had been originally obtained, but in the opposite 
direction. Thus a high Mf T score for a female 
subject, instead of representing a deviation in the 
direction of excessive “masculinity,” now  repre- 
sented a deviation in the direction of excessive 
“femininity.” This procedure had the effect of 
stabilizing the meaning of the Mf score for both 


assess 


3 Hathaway, S. R. Personal communication. July 
9, 1959. It may be noted that the emphasis given 
to profile shape by this index is in keeping with a 
trend toward emphasis on shape in the clinical in- 
terpretation of MMPI profiles. 


males and females. The ? validity score was dis- 
carded as superfluous because of the virtual absence 
of variability in this score in our sample of pro- 
files. T scores for all other scales appearing on the 
standard profile form, including L, F and K, were 
ranked ior all patients and therapists. For each 
patient-therapist pair, differences between ranks 
for each of the 13 scales were obtained by subtrac- 
tion and squared. The index of similarity was the 
sum of these squared rank differences for each 
pair.4 These indices, which, of course, are inversely 
proportional to degree ot profile similarity, ranged 
from 31.5 to 614.5 in the 60 pairs. 

The criterion for assessing success of therapy in- 
volved a composite of several outcome ratings made 
at the end of the period of treatment by the super- 
vising psychiatrist on each case. A total of eight 
senior staff members served as supervisors in the 
program and were assigned to patient-therapist 
pairs on an unsystematic basis. The supervisors were 
asked to make the ratings as part of a questionnaire 
concerned with a variety of variables under in- 
vestigation in the larger project of which this study 
was a part. So far as is known, none had any 
knowledge of the MMPI profiles of either the 
patients or the therapists whose cases they super- 
vised, nor were any aware that their ratings would 
be used to test the hypothesis which is the subject 
of this paper. 

Ratings on five items comprise the composite 
criterion. Two of the items asked the supervisor to 
rate (a) “degree of satisfaction of the patient with 
outcome of therapy,” and (b) “your rating of the 
outcome of therapy” on nine-point scales. The 
remaining three items were of a more complex na- 
ture and asked for a rating of the patient’s status 
with respect to three important dimensions at both 
the beginning and end of therapy; 
were occupational 


these dimensions 
adjustment, (b) adequacy 
of interpersonal relations, and (c) symptomatic 
status. These items were arranged on a five-point 
scale from poor functioning to good, each of the 
intermediate as well as the end points having an 
appropriate descriptive word or phrase to delimit 
their meaning precisely. The score on each of these 
more complex items was taken as the “end” rating 
minus that of the “beginning,” thus providing a 
measure of amount of positive change. Pearson r’s 
for the intercorrelations among all five items ranged 
from .32 to .73, the average value being 46. The 
overall “success” index was simply the algebraic 
sum of the scores on the above five items; it 
ranged from 2 to 20 points, with a mean of 11.63 
for the 60 cases. 

The therapists in the project 
complete the same research 
given to the supervisors. While it did not seem 
appropriate in the present study to use these 
therapist ratings as criteria of success, they do pro 
vide some basis for evaluating the reliability of the 
supervisor’s ratings. Forty-eight of the 60 therapists 


were asked to 


questionnaire as was 


#Since N is constant, computation of rho would 
be superfluous 
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MEAN SUCCESS 


DISSIMILARITY 


Fic. 1. Mean composite success rating as a function 
of patient-therapist similarity index. 


provided ratings on all of the component items of 
the success index. The interrater reliability coeffi- 
cient of the global success score, based upon these 
48 cases, was .55; this is highly significant statisti- 
cally, although it obviously accounts for only a 
portion of the total variance of the ratings. 


RESULTS 


The 60 pairs were first categorized ac- 
cording to score on the MMPI similarity 
index. As a compromise to achieve a reason- 
able number of categories, each comprising 
a reasonable number of pairs, it was decided 
to establish five groups of 12 pairs each. The 
i2 pairs with the lowest indices (< 265) 
were arbitrarily designated Group I, the 
“very high” similarity group. The next 12 
pairs in order of increasing scores on the 
index (265-318) became the “moderately 
high” similarity group and were designated 
Group II. Group III, comprising the next 
12 pairs (indices = 319-386), was the “me- 
dium” similarity group. Group IV (‘‘moder- 
ately low” similarity; indices = 387-456) 
and Group V (‘very low” similarity; in- 
dices = > 456) were established in like 
manner. 

The mean therapeutic success score, based 
on supervisor’s ratings as described above, 
was computed for the pairs in each of the 
five groups; these means are presented 
graphically in Figure 1. It can be seen from 
Figure 1 that the form of the relationship 
between patient-therapist personality simi- 
larity and therapeutic success is curvilinear, 
thus supporting the hypothesis. With increas- 


ing dissimilarity there is at first an increase 
in therapeutic success, but with further in- 
creases in dissimilarity beyond an optimal 
point there is a precipitous decline in the 
rated success of the therapeutic relationship.° 
A simple analysis of variance was performed 
on these data, resulting in an F of 3.06, 
which is significant beyond the .05 level for 
4 and 55 degrees of freedom.* Supplementary 
¢ tests revealed that Group IV, the group 
having the highest mean success rating, was 
significantly superior in rated success to 
either Group I (¢ = 2.38; p < .05) or Group 
V (¢ = 3.90; p < .01) at the extremes of the 
similarity dimension. 

The interpretation of this finding, based 
as it is upon a global index of profile simi- 
larity, requires a cautious approach along 
the lines suggested by Cronbach’s (1958) 
carefully reasoned analysis of such measures. 
Particularly relevant in this context is the 
possibility that the obtained relationship 
could be accounted for wholly on the basis of 
profile variations in one or another of the 
two classes of subjects, patients or therapists. 
Under such a circumstance a dyadic interpre- 


tation would be nonparsimonious. Suppose, 


for example, that the MMPI profiles of 
therapists were to vary but little in shape 
from a “constant” pattern reflecting a de- 
fensive set, while the profiles of patients 
might show marked variability in terms of 
similarity to this “defensive” pattern; the 
profile similarity index would then be essen- 
tially a measure of patient defensiveness, 
which might itself vary with success in a 
curvilinear fashion. Note, however, that this 
reasoning requires that there will be sys- 
tematic variations in the shapes of profiles 
for patients at the various levels of patient- 
therapist similarity, and unless this con- 


5It might be noted that none of the five groups 
of patient-therapist pairs were markedly unsuc- 
cessful as judged from average success scores. Pos- 
sibly this results from a systematic bias of the 
raters to upgrade the success of the relationships 
they supervised. On the other hand, however, we 
have in fact been surprised by the unexpectedly 
high level of overall success obtained by patient- 
therapist pairs in this program. 

® This is equivalent to a correlation ratio (eta) 
of 43 for the regression of success scores on 
similarity. 
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dition were met one would be obliged to re- 
ject the proposed monadic interpretation of 
the results. In order to evaluate this kind of 
interpretation, as well as the monadic in- 
terpretation corresponding to the less prob- 
able case of therapist’s profiles varying in 
similarity to a “common” patient profile, the 
“average” profile for both patients and ther- 
apists in each of the five groups was con- 
structed from mean scores on each scale. 
These mean scores are presented in Table 1. 
The scales on each of these average profiles 
were subsequently ranked in the same manner 
used to compute the similarity indices. Pro- 
file shape, as represented in these rankings, 
appears to be strikingly similar across the 
five groups for patients as well as therapists. 
This impression was confirmed by computing 
a coefficient of concordance (W) for each of 
the two sets of ranked profiles; W was .88 
for the patients and .91 for the therapists 
(Maximum value = 1.00). We may con- 
clude, then, that neither the patients nor the 
therapists in general show systematic differ- 
ences in profile shape in the five groups. 

It might be observed that there are unex- 
pectedly substantial variations in elevation 
among the five mean profiles of patients. 
However, these variations do not appear to 
be related in any regular fashion to degree of 
patient-therapist profile similarity, nor is it 
easy to conceive of how they could account, 
on a monadic basis, for the results obtained. 
Relatively, patients in Group I, who have 
low profile elevation, had an unsuccessful 
therapy experience, while patients in Group 
IV, who also have low elevation, had a suc- 
cessful experience. On the other hand, 
patients in Group III, who have high eleva- 
tion, successful relationships, while 
patients in Group V, also having high eleva- 
tion, had unsuccessful relationships. Thus the 
data will not readily support a monadic in- 
terpretation of the results, and the view is 
justified that we are dealing with a truly 
dyadic phenomenon. 


DISCUSSION 


Perhaps the most obvious question con- 
cerning the results of this investigation is 
the extent to which they can be generalized. 
The patient-therapist relationships studied 


here were atypical in two respects: (a) the 
marked inexperience of the therapists, and 
(6) the time-limited (and necessarily goal- 
limited) orientation of the treatment. The 
question of generality is clearly an empirical 
one, calling for replication of the study in a 
more typical psychotherapeutic setting. It 
might be noted, however, that our findings 
parallel those of Fiedler (1958), who con- 
cludes from his work with task groups that 
group effectiveness is related in curvilinear 
fashion to the “psychological distance” of the 
group leader. Insofar as the task group and 
the therapeutic dyad are comparable there 
is a basis for assuming some generality of 
the phenomenon. Moreover, the fact that the 
therapists in this study were essentially un- 
trained suggests that the phenomenon is 
somehow basic to personal interaction in the 
therapeutic context, and that whatever are 
the effects of specific psychotherapeutic train- 
ing they must operate with the personality 
similarity factor as a baseline. While it seems 
unlikely that the acquisition of psychothera- 
peutic knowledge and skills could render the 
similarity variable totally irrelevant, the 
extent to which training would alter the pat- 
tern obtained of course remains a funda- 
mental question. For the present, in any 
case, it can be stated as a working hypothesis 
that the effect demonstrated represents some- 
thing stable in patient-therapist relationships. 
Confirmation of this hypothesis could make 
possible the development of logical, objective, 
actuarially sound procedures for pairing 
patients with therapists, an exciting idea. 
The results give strong support to the views 
of interpersonally oriented psychotherapists 
who emphasize the importance of the nature 
of the interaction between patient and ther- 
apist as persons in determining the process 
and outcome of the therapeutic relationship. 
Fromm-Reichmann (1949), for example, has 
been very explicit on this point, suggesting 
that therapists should refrain from working 
with individuals whose personalities are 


markedly out of tune with their own. The 
problems have come in attempting to specify 
in advance with precision who should and 
who should not get together in a psycho- 
therapeutic relationship. The identification of 
which 


a measurable interactive dimension 
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may be related to therapeutic outcome in a 
specifiable way (i.e., global personality simi- 
larity) could be a step toward achieving what 
is needed in this regard. 

Turning to the question of what these re- 
sults mean in terms of the therapeutic process, 
it is perhaps not surprising, in view of the 
rather gross nature of our approach, that 
not a great deal has been really clarified. 
The reasoning underlying our hypothesis was 
admittedly vague and inelegant—essentially 
we assumed that excessive similarity might 
make the therapist ineffective by virtue of 
overidentification with the patient and his 
problems, and that excessive dissimilarity 
might have the same result by making it im- 
possible for him to understand or appreciate 
what the patient was trying to say to him. 
Fiedler (1958) explains his findings on task 
group leadership in somewhat similar terms. 
While the range of possible explanatory 
constructs for the phenomenon demonstrated 
is very broad, we suspect that in essence most 
of them will reduce to something very close 
to this rather simple formulation. At present 
there appears to be little better to offer. 

The conceptual difficulties encountered in 
attempting to come to grips with the result of 
this study stem in part from its inherent limi- 
tations, in particular our enforced reliance on 
a global index of similarity, and in part from 
the not unrelated problem of having very 
little knowledge upon which to base our 
thinking about what are the important di- 
mensions of the therapeutic relationship. We 
hope and expect that further research will 
resolve these issues. 


SUMMARY 


An index of personality similarity for 
patient-therapist pairs was derived from 
scores made by a group of 60 patients and 
their therapists on the MMPI, and an 
attempt was made to relate similarity to 
success in therapy, as measured by super- 


visor’s ratings. Success was found to vary 

significantly with similarity, the form of the 

relationship being curvilinear, in accord with 
the investigators’ hypothesis that either ex- 
treme similarity or extreme dissimilarity 
would impede the therapeutic process. The 
results were discussed in terms of their 
generality, and their practical and theoretical 
implications. 
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According to Freudian theory (1938) there 
is a symbolic association between the male 
and female genitalia and certain classes of 
objects which are similar to them in shape. 
The experimental investigation of the Freud- 
ian hypothesis was initiated by Levy (1954), 
and has been extended by Starer (1955), 
Jones (1956), Winter and Prescott (1957), 
and Moos and Mussen (1959). Although 
Levy, using children as subjects, did not find 
support for the Freudian hypothesis, positive 
results were reported by later investigators 
using adult subjects. Jones found that males 
identified more symbols in a manner con- 
sistent with the Freudian hypothesis than did 
females. In contrast to the findings of Levy 
and Jones, most psychoanalytic writers (e.g., 
Erikson, 1950; Fenichel, 1945) believe that 
children of both sexes are aware of biological 
sex differences as early as three or four years 
of age. To this writer’s knowledge, neither 
Jones’s nor Levy’s results have been 
replicated. 

The study by Levy, and those which fol- 
lowed, investigated the association suggested 
by Freud between the shape of the symbol 
and its purported referent, the sexual organ. 
It is suggested by this author that a relatively 
shapeless stimulus, such as a texture, may also 
be adequate to symbolize some aspects of 
masculinity and femininity. Rabin (1959) 
found that the “mother” and “father” cards 
on the Rorschach were perceived differently 
in terms of their textures, i.e., Card IV was 
perceived as heavy and rough, whereas Card 
VII was perceived as light and smooth. 

The present writer sought to explore both 
the symbolic significance of structured 
(shaped) symbols, with special reference to 
age and sex differences, and to introduce an 
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to T. Allen, G. Anderson, H. H. Anderson, D. G. 
Brown, A. I. Rabin, and S. Siegel for their helpful 
criticisms of the study, and to the public schools of 
East Lansing, Michigan for their cooperation in 
making subjects and facilities available. 
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unstructured stimulus, a texture, as another 
possible mode of symbolic expression of 
masculinity and femininity. Specifically the 
study was conducted to answer the following 
questions: 

Are structured symbols sorted as pre- 
dicted by the Freudian hypothesis? Is the 
response to the structured symbols affected 
by the age and sex of the respondent? 

2. Can textures be used to symbolize mas- 
culinity and femininity, i.e., are the dark 
and rough textures more often identified as 
masculine, and are the light and soft textures 
more often identified as feminine than would 
be expected by chance? Are there age and 
sex differences in the responses to the texture 
symbols? 

MeETHOD 


Sample. The sample was composed of 120 sub- 


jects: 20 male, and 20 female subjects at the 
fourth-, ninth-, and college grade levels with 
mean ages of 9-4, 14-4, and 20-2, respectively 


Although the subjects were not individually matched 
for intelligence and socioeconomic background, the 
three groups were believed to be similar in these 
respects. The grade school and high school popu- 
lations from which the subjects were drawn were 


highly saturated with the children of university 
faculty and students. 
Materials. Twenty structured and 20 unstruc- 


tured (texture) symbols were used. There were 10 
male and 10 female symbols of each type.2 The 
structured symbols were drawn on 3 X 5 cards and 
the texture symbols on 3X3 cards. The symbols 
were selected from a larger group which was sub- 
mitted to five judges well versed in psychoanalytic 
theory. A symbol was included in the final set if at 
least four out of five judges agreed upon its sym- 
bolic significance, and if it added to the overall va- 
riety of the symbol set. The structured symbols pre- 
sented to the judges included those used by Levy 
(1954) and Starer (1955), as well as other symbols 
created by the writer so that the figures could not 
be sorted on the basis of sharpness vs. roundness. 
The final group of structured symbols included one 
symbol used by Levy and five of those used by 
Starer. The texture symbols presented to the judges 
were created by the writer and varied generally on 
the dark-light and rough-smooth continua. 


2 Copies of the symbols are available from the 
author upon request. 
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Procedure. Each subject was seen individually by 
the author and was instructed to sort cards into 
one of two piles as follows: 


I have some cards with figures or pictures on 
them. I would like you to place each card in the 
pile to which you think it belongs. If it reminds 
you more of a father, brother, boy, or man, place 
it in this (pointing) pile. If it reminds you more 
of a mother, woman, girl, or sister, place it in this 
pile. They will not look like anything, so you will 
have to guess. Please turn the cards over when you 
put them down. 

Two labels were placed in front of the subject. 
On one label was printed “father, brother, boy, 
man,” and on the other, “mother, sister, girl, 
woman.” A small achromatic picture of a man 
and boy and one of a girl and woman were placed 
above the labels at the head of the corresponding 
piles for the fourth-grade subjects. The order of 
presentation of the structured and texture cards, 
and the side on which the male and female labels 
were placed, were randomized. The pile in which 
the cards were sorted was recorded by the experi 
menter while the subject was sorting. The score for 
each subject was the number of cards sorted in the 
direction designated “correct” by the judges. 


RESULTS 
Structured Symbols 


1. The ¢ test was used to compare the 
number of structured symbols sorted in agree- 
ment with the judges’ ratings with chance 
expectancies. For all age-sex groups a greater 
number of structured symbols was correctly 
identified than would be expected by chance 
at less than the .01 
(Table 1). 

2. An analysis of variance was used to test 
for age and sex differences (Table 2). No 
significant differences were found between 
age groups in their identification of struc- 
tured symbols. 


level of significance 


TABLE 1 


VALUES oF ? RESULTING FROM THE COMPARISON OF 
THE NUMBER OF STRUCTURED SYMBOLS IDENTIFIED 
CORRECTLY WITH CHANCE EXPECTANCY 


Grade Fourth Ninth 


College 
Males 
Females 
Both sexes 


5.67*** 


4.16*** 


6.29%** 
5.80*** 


8.66*** 


3.09** 


Note.—-n = 20 for each subgroup. 
*p < 01 


< 


UNSTRUCTURED 


TABLE 2 


STRUCTURED SYMBOLS 


Correct responses 
(n = 20) 


Male 


Female 


SD 
2.21 
2.47 
2.62 


Grade ] S M 


Fourth 
Ninth 
College 


12.65 
12.3 
13.4 


Analysis by age and sex 


(N = 120) 


Source 


Sex 

Age 

Age X sex 
Error 


3. Reference to Table 2 indicates that male 
and female subjects did not differ signifi- 
cantly in their identification of structured 
symbols. 

4. The significance of the subjects’ agree- 
ment on the sorting of each structured sym- 
bol was tested by use of the binomial distri- 
bution. The results of these tests are pre- 
sented in Table 3. 


Texture Symbols 


1. When the average number of texture 
symbols identified correctly by individuals 
within each age-sex group was compared 
with chance expectancies by the use of the 
t test, it was found that the ninth-grade and 
college groups identified significantly more 
texture symbols correctly than would be 
expected by chance. The responses of the 
fourth-grade subjects did not differ from 
chance except as a result of a reduction in 
the standard error of the mean in the com- 
putation over the total fourth-grade sample 
(Table 4). 

2. An analysis of variance was used to test 
for age and sex differences (Table 5). Age 
differences were significant at less than the 
.O5 level. Individual comparisons were made 
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TABLE 3 


BINOMIAL PROBABILITY OF AGREEMENT ON THE SORTING OF 
STRUCTURED AND UNSTRUCTURED SYMBOLS 


Structured symbols 
(n = 40 for each grade level) 


Symbol Fourth Ninth College 


NOUS 


.05 in opposition to judges’ ratings 


using the studentized range distribution 
(Dixon & Massey, 1957). None of these com- 
parisons reached statistical significance. 

3. Significantly more females than males 
identified the texture symbols in agreement 
with the judges (Table 5). 

4. The subjects’ agreement on the sorting 
of each texture symbol was tested by the 
binomial distribution. The results are pre- 
sented in Table 3. 


TABLE 4 


VALUES OF ¢ RESULTING FROM THE COMPARISON OF 
THE NUMBER OF TEXTURE SYMBOLS IDENTIFIED 
CORRECTLY WITH CHANCE EXPECTANCY 


Fourth 


Grade Ninth College 


Males 
Females 
Both sexes 


1.28 
1.89 
2.29* 


4.43*** 
6.83*** 


7.79*** 


2.63* 
4.45*** 
4.98*** 


Note.—n = 20 for each subgroup. 


05. 
< 


Unstructured symbols 
(n = 40 for each grade level) 


Ninth 


Symbol Fourth College 


21 
22 
23 
24 
25 

26 

27 


28 


DISCUSSION 


The present data are consistent with the 
results of earlier studies by Starer (1955), 
Jones (1956), Winter and Prescott (1957), 
and Moos and Mussen (1959), which sup- 
port predictions from Freudian theory that 
elongated and pointed objects would be 
sorted into a male category, and rounded or 
containing objects into a female category. 
In the present study the subjects of both 
sexes in all age groups identified the struc- 
tured symbols in a manner consistent with 
the Freudian hypothesis. The present data 
are also consonant with the psychoanalytic 
observations of Fenichel (1945), and studies 
by Conn and Kanner (1947), and Brown 
(1958), which indicated that anatomical sex 
differences are recognized by boys and girls 
as early as ages four or five. 

In contrast to the present results, Levy’s 
(1954) fifth-grade subjects did not identify 
his structured symbols better than chance. 


46 
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Three differences in procedure may account 
for the conflicting results, although these 
same procedural differences do not seem to 
be important with adults (Stennett & Thur- 
low, 1958). Levy’s subjects matched the 
symbols to names rather than to nouns at 
an age when it is found that names are not 
often used to distinguish boys from girls 
(Conn & ‘Kanner, 1947). Levy also employed 
a group procedure which may have caused 
difficulty in the detection of those children 
who, from poor motivation or lack of under- 
standing, paired the names and symbols 
randomly. Finally, the population sampled in 
the present study may differ from that of 
Levy’s in socioeconomic background and in- 
tellectual level. 

Jones (1956) found that males identify 
more sexual symbols correctly than do 
females. Jones’s findings may have resulted 
from an emphasis upon the sexual significance 
of the symbols which seemed to be contained 
in his instructions. Adult males may more 
readily respond to “sexual” symbols than 
adult females in the presence of a male 
experimenter. In the present investigation, in 
which no sex differences were found, the 
sexual nature of the symbols was not stressed, 
nor seemingly recognized. 


TABLE 5 


TEXTURE SYMBOLS 


Correct responses 


Female 


Grade 


Fourth 10.9 
Ninth 12.6 
College 12. 


3.14 
2.62 
3.40 


Analysis by age and sex 


(N = 120) 


Source df 


Sex 

Age 

Age X sex 
Error 


*p <.05. 


As opposed to the responses to the struc- 
tured symbols, the adolescent and adult sub- 
jects identified more unstructured symbols 
correctly than the prepubescent subjects, and 
the female subjects sorted more unstructured 
symbols correctly than the male subjects. The 
discrepancy between the data from the two 
types of symbols may be understood by 
assuming that each group of symbols refers 
to different aspects of sexuality. The essential 
bond between the symbol and the symbolized 
in the Freudian hypothesis is shape. The 
textures used, however, are relatively shape- 
less. This author assumes that the textures 
symbolize that aspect of masculinity or 
femininity which is culturally rather than 
anatomically defined, i.e., the cultural sex- 
role. 

In the present study the subjects of all age 
levels identified the all black card (S-29; 
symbol numbers refer to Table 3) as male, 
and a light grey (S-38), and a soft textured 
card (S-32) as female. The subtle variations 
in texture were missed by the fourth-grade 
subjects which resulted in a_ statistically 
significant age difference in the number of 
texture symbols correctly identified. If the 
initial assumption is granted, one could con- 
clude that fourth-grade children are, at 
some level, aware of the grosser aspects of 
cultural sex-roles, as indicated by Rabban 
(1950) and Brown (1958), but not of the 
subtleties which seem to be learned later. 

The ninth-grade and college subjects sorted 
four texture symbols which were quite dark, 
rough, and blotchy as masculine (S-21, 22, 
23, 29) and six texture symbols (S-32, 34, 
35, 36, 38, 39) which were generally light 
in tone, airy, fluffy, and somewhat fragile in 
appearance as feminine. The symbols which 
were not consistently sorted as either mascu- 
line or feminine (e.g., S-31, 33, 40) appeared 
to have the characteristics mentioned above 
to a lesser extent or in some mixture. 

Female subjects sorted more texture sym- 
bols correctly than male subjects and the dif- 
ference tended to increase with age. These re- 
sults suggest that the girls in the present sam- 
ple were more acutely aware of the cultural 
sex-role stereotype than were the boys. In 
contrast, Lynn (1959) postulates that a 
young girl is somewhat confused by the lati- 
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tude allowed her in her sex-role behavior, 
and that a boy’s sex-role is quite clearly de- 
fined. The apparent conflict between the views 
may be a function of the present writer’s 
sample, i.e., more latitude in sex-appropriate 
behavior may be allowed the male in a uni- 
versity setting than in the modal population 
which may be reflected in the sorting of tex- 
ture symbols. 

The author has assumed that the textures 
used in the present study symbolize cultural 
sex-role expectancies, whereas the structured 
symbols refer only to anatomical sexuality. 
Post hoc examination of the sorting of each 
structured symbol compels one to reconsider 
the assumption of the present and earlier 
writers that structured symbols have the 
genitals as their sole referent. Subjects of all 
ages agreed that elongated symbols should be 
sorted in the male pile although elongated- 
ness alone was not sufficient. The pointedness 
of the symbols seemed to be neither a neces- 
sary nor a sufficient element. The three sym- 
bols (S-15, 16, 17) which were sorted con- 
sistently as female by all age groups were 
curved in shape, but not all curved figures 
were similarly sorted. 

It was informally observed that many sym- 
bols seemed to provide two sets of cues, one 
set referring to anatomical sexuality, and the 
other to culturally defined sexuality. The sex- 
role cues sometimes were consonant with the 
anatomical cues, in which case the symbol 
was sorted consistently as either male or fe- 
male, e.g., an elongated and pointed symbol 
(S-2), seen by many as a glider, was sorted 
consistently into the male category. Other 
symbols, which were sorted irregularly, con- 
tained conflicting sex-role and shape cues 
(S-10, 18, 19, 20), e.g., one symbol (S-20) 
composed of three concentric circles was 
rated by judges as female, probably because 
of its roundness, but many subjects sorted 
this symbol into the male pile while infor- 
mally commenting on its resemblance to a 
target. It is tentatively suggested on the ba- 
sis of the former observations that when the 
anatomical and cultural cues of a structured 
symbol are consonant, the sorting of the sym- 
bol will be consistent. If these cues are dis- 
sonant, the response to it will be confused. 
There was some indication that the same con- 
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fusion was contained in some of the texture 
symbols. For example, one dark texture sym- 
bol (S-26) composed of an entanglement of 
curved lines was rated as masculine by the 
judges, but the ninth-grade subjects agreed 
it should be sorted as feminine. 

Although both the unstructured and struc- 
tured symbols have been discussed as if the 
sex-role and Freudian referents could be iso- 
lated, a cultural-biological symbol dichotomy 
probably does not exist. Culturally, sex-role 
prescriptions are sometimes accepted as real 
biological differences (Newcomb, 1950), and 
psychologically, the distinction appears more 
theoretical than real (Colley, 1959). A dy- 
namic interaction between the cultural and 
anatomical elements of a symbol might be 
postulated from a psychoanalytic point of 
view. For example, the conscious recognition 
of the genital referent of a structured symbol 
may be defended against by responding to its 
consonant sex-role cues, its dissonant sex-role 
cues, or by denying that it has any meaning. 
The defensive reaction to a symbol is not re- 
corded by a nomothetic approach which tabu- 
lates only “correct” responses. The Freudian 
hypothesis is thus tested without sufficient 
regard for Freudian theory. 

A further criticism of the present and ear- 
lier studies involves the communality in the 
cultural background of the judges and sub- 
jects. The lack of judge-subject independ- 
ence would spuriously increase the number of 
symbols correctly identified on the basis of 
common cultural cues rather than because of 
the subjects’ recognition of the anatomical 
meaning of the symbols. The between-subject 
agreement on the sorting of a symbol indi- 
cates only that the symbol had some com- 
mon meaning, but does not specify if the 
meaning is cultural or anatomical.* 

The interpretations of the results which 
have been presented are untested and are 
offered for their heuristic value. If it were 
established that “universal symbols” had cul- 

3 The studies of Barker (1957), and Schonbar and 
Davitz (1960) have come to the attention of the 
author after submission of this manuscript. Their 
results indicate that both adult and child subjects 
sort symbols with obvious cultural significance ac- 
cording to the cultural cues, even if these cues are 


at variance with the anatomical value of the 
symbols. 
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tural components, then more attention to the 
cultural or subcultural significance of the 
symbols would be in order in their interpreta- 
tion. An awareness of the possible multiple 
dimensions of a symbol (e.g., shading and 
shape), and multiple determination of the 
response to a symbol (cultural, anatomical, 
and psychological) would allow greater inter- 
pretive flexibility. 


SUMMARY 


Structured and unstructured sexual sym- 
bols were administered to 40 fourth-grade, 
40 ninth-grade, and 40 college subjects. The 
structured symbols were sorted in agreement 
with the judges’ ratings at all age levels and 
by both sexes. The unstructured symbols 
were identified in a manner consistent with 
judges’ ratings by the adolescent and college 
groups, but not by the preadolescent group. 
A sex difference was found indicating that 
female subjects identified more unstructured 
symbols consistent with the judges’ ratings 
than male subjects. A discussion was pre- 
sented concerning the possible multiple de- 
termination of the responses to sexual sym- 
bols, the multiple dimensions of the symbols 
themselves, and the relation of the results to 
the Freudian hypothesis concerning sexual 
symbolism. 
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Assessment of the learning capabilities of 
presumably retarded children is a difficult 
operation. As Masland, Sarason, and Gladwin 
(1958), Clarke and Clarke (1958), and 
Haggard (1954) have shown, instruments 
such as the Stanford-Binet and the Wechsler- 
Bellevue may prove inadequate. The range of 
what these tests measure is relatively re- 
stricted to academic and verbal criteria. They 
tend to ignore cultural, motivational, and 
situational differences crucial to test per- 
formance. The most widely used tests also 
ignore the context within which particular 
functioning abilities are relevant. 

In this report, we want to emphasize the 
pertinence in many contexts, and certainly 
the pertinence within institutions, of assessing 
specific functioning abilities as distinguished 
from measures of general intelligence. Follow- 
ing Baldwin (McClelland, Baldwin, Bronfen- 
brenner, & Stodtbeck, 1958), we have defined 
ability as “a characteristic of the person 
which permits him to behave adaptively.” As 
Baldwin remarks, in trying to describe spe- 
cific abilities, we are trying to identify sys- 
tems of adaptive behavior. Moreover, ability 
cannot be considered to be unidimensional, as 
there may be a variety of ways of accom- 
plishing the same result. 

In the case of institutionalized retarded 
children, tests of functioning abilities should 
allow us to estimate the individual child’s 
present level of ability to solve abstract prob- 
lems, to comprehend the communications of 
others and to express himself, and to function 
in social situations. Measurement of relative 

1 This research was supported in part by United 
States Public Health Service Grant Number OM-111, 
and in part by a grant from the University of 
Kansas. The authors acknowledge the helpful read- 
ing and suggestions of R. L. Schiefelbusch, E. Gor- 
don Collister, Margaret C. Byrne, Harris Winitz, and 
Joseph Spradlin. 

2 Now at Dartmouth College. 


levels of ability in these areas has greater 
practical utility than measurement of the 
extent to which general intelligence deviates 
from noninstitutional norms. 

This report presents findings obtained in 
an effort to assess the value of one measure 
of intelligence, the Porteus Maze Test, as a 
predictor of three types of functioning abili- 
ties known to be pertinent to life in an insti- 
tution: namely, language ability, perception 
of physical surroundings, and social maturity. 

As Masland, Sarason, and Gladwin (1958) 
suggest, the Porteus Maze Test seems from 
past research to predict such abstract quali- 
tative attributes as alertness, foresight, and 
ability to plan. It is a test that children can 
feel comfortable working with. The subjects 
in the cottage group studied in this project, 
boys ranging in age from 6 to 12, had with 
few exceptions tried and failed to remain in 
public schools. All had been subjected to a 
barrage of clinical and academic tests prior 
to entry into the institution and had been 
found lacking. Our concern was to apply 
tests that promised the least likelihood of 
tapping reservoirs of disposition to defeat or 
failure. 

The Porteus contains no built-in indicators 
of language ability, perceptual awareness of 
surroundings, or social maturity. Unlike the 
intelligence tests most commonly used on re- 
tarded children, the Porteus is nonverbal and 
abstract. Of course, verbal instructions are 
involved, but they are extremely minimal. 
By abstract, we mean that if the Porteus 
measures qualities relevant to practical func- 
tioning, it does so only indirectly. 

We defined the characteristic measured 
by the Porteus as the individual level of 
ability to solve abstract problems denoted by 
a maze of lines on pieces of paper which 
were not intentionally representative of or 
analogous to the other types of behavior 
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being assessed. Mental age (MA) on the 
Porteus was thus viewed as an index of an 
individual’s ability to solve an abstract intel- 
lectual problem. 

The task was to determine whether MA on 
the Porteus was predictive of three types of 
functioning ability and to assess at least one 
aspect of the reliability of the Porteus test 
as well. It is clear that the answer to the 
question of whether the Porteus is valid with 
regard to content and construct will depend 
upon the psychological insights of qualified 
and expert judgment. But the ultimate, the 
basic, value of the Porteus is its “functional” 
validity. The question here is, how well does 
it work in situations for which it is intended? 
How well does it select, differentiate or pre- 
dict? Before considering the predictive power 
of the Porteus, we shall describe our measures 
of the three functioning abilities. 


MEASURES OF ABILITIES 
Environmental Map Tests 


Two experimental map tests were developed to 
estimate accuracy of immediate surround. The first 
consisted of a two dimensional map of the floor of 
the institutional cottage in which the 29 subjects 
were housed. Its cartographic features were simple. 
An architectural floor plan was drawn, but com- 
plex features were removed. Lines for windows, 
doors, and the like, were removed. Some elements 
of the cottage were emphasized by representational 
overlay, such as the television set in the lounge, the 
aide’s desk, the front door, and two bathrooms, and 
a few beds. 

Each subject was presented with this Cottage 
Map and was asked to locate in the following order: 
entry, aide’s area, recreation room, lounge or tele- 
vision room, bathroom, and his own bedroom. 
Finally, the subject was asked, “What room do you 
like most?” 3 

A second map, the Grounds Map, was a simplified 
two dimensional drawing of the institutional plant. 
An arrow was inset to identify the home cottage. 
Paths were drawn between those five buildings the 
subjects were observed to utilize most frequently. 
The buildings were emphasized by three dimensional 
insets and by enlarging their size relative to the 
scale of the map. The procedure was to have the 
subjects indicate the path to each building rather 
than to identify the building by its appearance. 
Subjects were asked how they would get to each of 
the following structures: dining room, gymnasium, 
hospital, school, and administration building. 

The two map tests were significantly intercorre- 
lated (r = .70, p < .01), permitting combination. 


3 About two thirds chose their own bedrooms. 


The Language Test 


Ability to comprehend the communication of 
others and ability to verbalize are highly pertinent 
functioning abilities that require close assessment in 
the case of retarded children. The Parsons Language 
Sample, (Spradlin, 1960) was developed for this 
purpose. Using concepts of verbal behavior proposed 
by Skinner, Spradlin* devised a series of seven 
subtests based on 121 items that constitute tasks 
presented to subjects as follows. 

The first subtest, Tact, requires the subject to 
name a series of objects and pictures. The examiner 
holds the object or picture in view of the child 
and asks, “What is it?” An appropriate vocal re- 
sponse is recorded for each item if the subject gives 
one of a number of previously specified responses. 
Items in the second subtest, Echoic, require the sub- 
ject to repeat a series of words, phrases, or sen- 
tences, an appropriate response being a complete 
repetition of the vocal stimulus. The Intraverbal sub- 
test requires the subject to complete sentences. An 
appropriate vocal response is determined by the 
part of the sentence presented by the examiner. An 
example is: “The flag is red, white and —————.” 

Subtest four, Comprehension, requires the subject 
to follow simple directions. Some directions are 
given with speech only, some directions are given 
with gestures only, and some are given with a 
combination of speech and gesture. All items in the 
subtest require the child to perform some specific 
motor act. Echoic Gestures require that the subject 
mimic a series of motor acts first demonstrated by 
the examiner. 

In the sixth subtest, Intraverbal Gesture, the sub- 
ject is asked questions about the function of objects. 
This subtest is so designed that both vocal and 
gestural responses are highly probable. Neither a 
vocal nor gestural response precludes the possibility 
of use of the other. The gestures elicited by these 
items are usually specific in nature and tend to 
demonstrate the function of the object. For example, 
if the child is asked, “What do you do with a 
key ?” he often makes a motion with his hand as if 
he were twisting a key to unlock the door. 

The Mand subtest consists of a series of seven 
items which are designed to elicit demands, requests, 
or questions from the subject. As in the Intraverbal 
Gesture subtest, an appropriate response may be a 
gesture and/or a vocal response. The subtests are 
classified as either vocal or nonvocal in the tables. 
The three subtests, Tact, Echoic, and Intraverbal, 
demand vocal responses exclusively. Thus the single 
measure obtained on each of the three tests is 
classified as vocal. The Echoic Gesture and Compre- 
hension subtests demand nonvocal responses exclu- 
sively and the measure obtained on each subtest is 
classified as nonvocal. Both the Mand and Intra- 


4 The authors are grateful for the opportunity ex- 
tended by Joseph Spradlin to employ his Language 
Sample and for his assistance in supervising the 
testing. 


51 
™ 
} 4 
1 
4 
~ 
j 
2 
ae 
= 
pe 


52 RosBert A. DENTLER AND BERNARD MACKLER 


verbal Gesture subtests yield vocal and nonvocal 
scores. 

For 40 retarded children in the same institution, 
the average test-retest reliability for the subtests 
was r, average = .90 (Spearman). To assess con- 
current validity, Spradlin (1960) obtained aide 
rankings of retarded children on criteria of vocal 
and nonvocal communication ability. Rankings on 
vocal ability for 171 subjects were correlated with 
rankings on the pertinent portions of the Language 
Sample. The r, average for nine groups was .61. 
The rank correlation for 160 subjects on the non- 
vocal subtests with aide rankings, 7, average for 
10 groups was .60. 


The Social Maturity Test 


Doll’s Vineland Social Maturity Scale (1953) is a 
serviceable instrument for appraising an individual’s 
social competence. Its emphasis is upon ability to 
perform independently rather than ability to ad- 
just socially to other persons. In terms of socializa- 
tion, it bears most directly upon independence 
training. 

To assess this type of social maturity more di- 
rectly, the authors devised a laboratory test based 
on items like those used in the Vineland interview 
schedule.5 Direct assessment was deemed important 
in general for more reliable measurement and in 
particular for assessment of children entering an 
institution, whose parents or guardians would be 
responding to the Vineland schedule under volatile 
conditions. 

A pool of 26 items ranging in difficulty from 
ability to unwrap candy to ability to write name 
with a pencil, each modeled upon Vineland items, 
was cast into an inventory. Subjects were tested 
individually in a laboratory under conditions that 
simulated semistructured play therapy. The tester 
checked success or failure on each item on the 
schedule as it was attempted by the child. Test- 
retest reliability for the 26 items was 7 = .98, with a 
2-week interval between tests. 

To improve the internal consistency of the instru- 
ment, the 11 items from the pool of 26 which 


5The authors are grateful for the assistance ex- 
tended by Gerald Siegel in the creation and admin- 
istration of this test. 


reflected test-item intercorrelations ranging from 
.70 to 30 were selected to form the index. 

This index of social maturity correlated .90 with 
standard scores from the Vineland test administered 
earlier by institution staff members to subject’s 
parents. 


VALIDATION OF ABILITY MEASURES 


As stated earlier in the paper, ability was 
defined in terms of adaptive characteristics. 
The investigators undertook correlational 
analysis (Pearson) with the four tests, 
Porteus, Maps, Language, and the Social 
Maturity, and measures of behavior and 
peer and aide social status (Dentler & Mack- 
ler, 1960). 

Measures of behavior consisted of 3 weeks 
observation and recordings of same by the 
investigators. These records were transcribed 
by a trained recorder and acts initiated, both 
friendly and unfriendly, were noted. 

To assess status among peers, each child’s 
photograph was mounted in a portable board. 
The pictures were randomly assigned in six 
rows of five columns. Each child was seen 
individually and the questions about play, 
work, and identification were asked. 

Aides responded to a questionnaire that 
consisted of questions parallel to the chil- 
dren’s including aide preferences for play 
and work, and those children who responded 
to cottage rules and regulations. 

As an indication of the association between 
the ability tests and behavior and sociometric 
status, some of the significant correlations are 
given. For the same 29 boys, Maturity vs. 
Initiates Friendly and Unfriendly Interaction 
= .55 (p< .01). 

All four tests were significantly associated 
with sociometric peer choice status, Porteus = 
4S (p< .01), Maps = .31 (p< .05), Lan- 


TABLE 1 


MEAN, STANDARD DEVIATION, AND RANGE OF SCORES ON MEASURES OF 
INTELLIGENCE AND FUNCTIONING ABILITIES 


MA 
Porteus 


Scores CA 


Mean 9.61 
Standard deviation 1.91 
Range 6.67 to 12.75 
Maximum possible range 


5.46 
0 to 15.5 


MA Porteus 


revised Maps Language Maturity 


4.38 
3.50 
0 to 10 


5.03 
2.04 
2to9 
0to9 


5.07 
5.24 
—1l to +11 


0 to 121 —11to +11 
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TABLE 2 


REGRESSION COFFFICIENTS OF FUNCTIONING ABILITIES 
(y) oN Porteus Maze MA 


Functioning 
abilities 


Porteus 
Porteus revised 


Map Test 
Language ‘Test 
Maturity Index 


6 
75** 


** » < .00S. 


guage = .50 (p< .01), and Maturity = .48 
(p < 01). 

Correlation of ability tests with aide re- 
sponses revealed the following significant 
associations: aide play preferences and 
Maps = 41, (p< .01), and Social Matu- 
rity = .48 (p< .01); conformity with aide 
regulations and Porteus = .29 (p < .05) and 
Language = .32 (p< .05). The above cor- 
relations are cited as indicative of the “ex- 
ternal” validity of the ability measures. 

PROCEDURE 

The four tests were administered to the 29 mem- 
bers of one cottage group. No child had been insti- 
tutionalized prior to entry, and at the time oi 
testing no subject had resided in the cottage longer 
than 2 months or less than 1. The children had 
been admitted to the institution and tentatively 
diagnosed as educable retardates. None had been 
placed on drugs prior to testing. 

The first examiner had engaged in nonparticipant 
observation in the cottage for 3 weeks prior to 
testing and was a familiar figure to all the children. 
He administered the Porteus and the Map tests in 
one of the cottage bedrooms, the subjects having 
been invited casually to participate. The Cottage 
Map was administered first, each child being asked 
to point to the correct locations on the map. The 
Grounds Map was presented next, subjects being 
asked to point to the paths they would take to get 
to each of the five buildings. Finally, the Porteus 
Mazes were administered, following procedures speci- 
fied in the Porteus manual (1959). 

The Porteus Revision was administered by staff 
psychologists in the Psychological Division of the 
Institution, within one month after the first ad- 
ministration described above. The Language Test 
and the Maturity Index were administered by staff 
members of the Parsons Hospital Research Unit 
within the unit’s laboratories. Examiners for both 
these tests were not familiar with the children. 


RESULTS 
Table 1 reports means, standard deviations, 
and ranges of scores for the four tests of 


abilities. Lacking bases for conversion to 
standard scores on three tests, the data were 
transformed into T scale equivalents for pur- 
poses of correlation. 

The correlation between the Porteus and 
the Porteus Revision was r= .85. Several 
considerations lead us to interpret this as evi- 
dence of satisfactory  stability-reliability. 
First, this was not a simple retest. Different 
mazes, different testers, and different testing 
conditions characterized the procedure. More- 
over, some subjects who were able to solve 
maze problems on the first administration, 
under a tester who was very familiar with 
them and in the unthreatening setting of a 
cottage bedroom, were unable to perform at 
all on the administration of the second test, 
given by unfamiliar testers. 

Table 2 indicates that the second, revised, 
test serves about as well as the first as a 
predictor of the three functioning abilities.® 
The regression coefficients support the in- 
terpretation that the Porteus Maze Test is 
an adequate predictor of three pertinent func- 
tioning abilities, none of which is a priori a 
correlate of abstract problem solving ability. 
Chronological age was of course positively 
associated with MA. The rather large CA 
range of from 6-8 years to 12—9 years affected 
the correlations. When CA was held con- 
stant, however, each regression coefficient 
remained significant at the .025 level or 
better (Table 3). 

The intracorrelation of tests by subjects is 
also of interest. The subjects were ranked 
on the Porteus, Maps, Language, and Social 


6 Since T scale equivalents were used, incidentally, 
the regression coefficients are exactly equal to cor- 
relation coefficients between the variables. Cf., Allen 
E. Edwards (1954). 


TABLE 3 


INTERCORRELATIONS (r) OF FUNCTIONING ABILITIES 
WHEN CHRONOLOGICAL AGE 1s HELD CONSTANT 


Functioning 
abilities 


Porteus 
Porteus revised 


Map Test 53* 


Language Test A8* 


Maturity Index .58* 
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TABLE 4 


DISCREPANCIES BETWEEN RANKED SCORES 


Number cases showing discrepancy between ranks with Porteus 


Below —15 —15to —11 


—10to-—6 —Sto+5 11to15 


Maps 0 1 
Social Maturity 0 3 
Language 1 1 


Maturity tests, and Kendall’s coefficient of 
concordance was calculated. For the four sets 
of scores, W = .80, (p < .001) which indi- 
cates a very high degree of concordance 
among the scores taken by subject. 

Table 4 displays in more concrete fashion 
the trend of discrepancies between the 
Porteus and the three tests of functioning 
abilities. On the average, two thirds of the 
subjects reflected identical or virtually identi- 
cal ranks. Discrepancies in ability scores 
were mot associated with differences in CA. 

As a further check, the examiners attempted 
to answer another question. Does the general 
mental level of the child influence the 
amount of intertest discrepancies? Analysis 
of discrepancies by mental level on the 
Porteus revealed a complete lack of effect 
from mental level. That is, discrepancies do 
not vary in magnitude from MA level to MA 
level. 


DIscussION 


The Porteus is a fairly stable and adequate 
predictor of language ability, awareness of 
environmental surroundings, and social matu- 
rity. This simple, brief, test designed to 
measure abstract problem solving ability is 
useful in making environmentally pertinent 
distinctions between retarded children. The 
broad utility of the Porteus Maze Test is 
once again confirmed. 

Investigating the discrepancy between 
scores (Table 4) more closely, we found 11 
scores disagreed by at least 11 ranks with 
rank on the Porteus. Seven of the 29 chil- 
dren accounted for these discrepancies, how- 
ever. Three of the seven reflected discrep- 
ancies between Porteus and two of the three 
functioning abilities. 

In order to consider discrepancies ana- 


5 17 
1 18 
2 19 


lytically we viewed each individual’s score 
on the Porteus as an anchor point. Among the 
seven individuals with notable differences be- 
tween their ranks on the Porteus and their 
ranks on the other tests, we found two crude 
types: subjects whose language, environ- 
mental, and social maturity performances 
were below, and subjects whose performances 
were above, the rank obtained on the Porteus. 
Discrepancies may serve as crude profiles. 
The baseline for profile analysis here is not 
an “outside” population, of course, but the 
child’s peers within the institution. Ideally, 
this baseline would include a large sample of 
all currently institutionalized children at a 
particular institution. 

To illustrate this use of discrepancies, con- 
sider one of the boys in this sample who 
appears to be functioning below his Porteus 
MA level. He ranked third on MA but 
fifteenth on the Maps and fifteenth on Lan- 
guage. After his third month in the institu- 
tion, this boy was diagnosed as capable of 
effective performance in special education 
classes in public schools and was released 
from the institution. 

Properly considered, each test assessed the 
ability of each child to perform selected tasks 
under very special social conditions. The 
examiners were adults and the tests was ad- 
ministered under the limits of a two-person 
interaction situation. A more valid assess- 
ment of functioning abilities would include 
samples of behavior among peers and in 
more complex or variegated social settings. By 
viewing the Porteus Maze Test as a test of a 
particular kind of functioning and by relating 
it to tests of other types of behavior, two 
things were learned. For three out of four 
subjects, the Porteus Maze was an excellent 
predictor of level of functioning. For other 
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subjects, the discrepancies between the 
Porteus and the other tests were found to 
relate meaningfully to the behavior of the 
‘ children within the institutional cottage. 

The aim here has been to suggest very 
tentatively the interpretive utility of the 
evaluation of discrepancies. If a greater 


variety of measures of functioning abilities 
was used, this approach could be extended. 
Academic skills, subtests of particular lan- 
guage abilities, and motor skills, perhaps, 
could be added. The additional discrepancies 
between tests could facilitate rehabilitation 
and training planning for individual children. 


SUMMARY 


To explore the adequacy of the Porteus 
Maze Test as a predictor of three types of 
functioning abilities of retarded children, 
tests of language ability, cognitive awareness 
of environmental surrounds, and social matu- 
rity were administered to a cottage group of 
29 boys, newly admitted to a training institu- 
tion, who ranged in age from 6 to 12 years. 

The Porteus was selected as a general 
measure of learning capability because of its 
abstract, unthreatening characteristics and 
because of ease and economy of administra- 
tion. Scores from two Map tests—maze type 
sketches of the children’s new cottage and 
of the hospital grounds—were combined as 
representing the ability of the child to iden- 
tify representational symbols of environ- 
mental landmarks and the paths between 
them. 

The test of language ability was an instru- 
ment previously developed by psychologists 
employing Skinnerian categories of verbal 


behavior and tentatively established as suf- 
ficiently reliable and valid. The authors 
devised an index of social maturity by con- 
verting items from the Vineland Social 
Maturity Scale into tasks performed under 
laboratory conditions. The index was found 
to be highly stable and to associate highly 
with measures of interpersonal behavior. 

The Porteus Maze Test was found to be 
sufficiently stable and adequately predictive 
of Map, Language, and Maturity scores. 
Discrepancies between the four tests were 
explored discursively as indicators of diag- 
nostic assessment, with the suggestion that 
this approach might prove valuable for insti- 
tutional planning of individual regimens. 
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PREMORBID BEHAVIOR AND PROGNOSIS IN 
FEMALE SCHIZOPHRENIC PATIENTS’ 


AMERIGO FARINA,? NORMAN GARMEZY,? MARIAN ZALUSKY, ann JOSEPH BECKER 4 
Duke University 


Considerable research effort has been ex- 
pended in determining those characteristics 
of schizophrenic patients which have prog- 
nostic significance. This search for prognostic 
indicators has a practical purpose, since 
knowledge about the course of this illness is 
of value both to the hospital and to the pa- 
tient’s family in planning for the future. Of 
perhaps even greater value, however, is the 
contribution which such research may make 
to a better understanding of the disease proc- 
ess itself by providing cues about the relevant 
antecedent events and personality attributes 
of patients which are associated with remis- 
sion. The many difficulties encountered in 
attempts to evaluate prognostic factors have 
been discussed in detail by Blair (1940) and 
Huston and Pepernik (1958). 

For male schizophrenic patients, marriage 
is one factor consistently reported to correlate 
with recovery and discharge from the hospital. 
Counts and Devlin (1954), Jenkins and Gurel 
(1959), Lindemann, Fairweather, Stone, and 
Smith (1959), and Mason, Tarpy, Sherman, 
and Haefner (1960) all report that male mar- 
ried patients recover more rapidly than pa- 
tients who are single. This differential rate of 
recovery may indicate that the more mature 
and more adequately functioning males are 
more likely to marry and, because of their 
greater maturity, are also likely to recover 
more rapidly from a schizophrenic episode 
than single males.5 The greater social ma- 


1 This investigation was supported by a USPHS 
research grant (M-629) from the National Institute 
of Mental Health, United States Public Health 
Service. 

2 Now at the University of Connecticut. 

3 Now at the University of Minnesota. 

4 Now at the University of Illinois. 

5 The greater likelihood that married (relative to 
single) patients will leave and remain out of the 
hospital does not seem to be due to great conjugal 
family tolerance for pathological behavior. Davis, 
Freeman, and Simmons (1957) have presented evi- 
dence indicating that parental families are more 


turity of married as compared to single males 
is suggested by the fact that to marry each, 
typically, has had to make advances to a fe- 
male, court her, propose marriage and offer 
a home and economic support. The argument 
that adequacy of premorbid adjustment is 
related to recovery is supported by several 
prognostic studies employing the Phillips 
Premorbid Scale (Phillips, 1953). This scale 
quantifies premorbid adequacy in terms of 
social and sexual adjustment and marital 
status contributes to a patient’s total score. 
Studies employing the scale, (Farina & Webb, 
1956; Phillips, 1953; Seidel, 1960) all report 
that patients who have achieved more ade- 
quate levels of premorbid adjustment tend to 
recover more rapidly from a schizophrenic 
episode than do those with a poor premorbid 
history. Laboratory studies indicating more 
adequate functioning under stress of good as 
compared to poor premorbid schizophrenic 
patients have been summarized recently 
(Garmezy & Rodnick, 1959; Rodnick & 
Garmezy, 1957). In summary, it would ap- 
pear that good premorbid adjustment is as- 
sociated both with marriage and remission 
for male patients and that greater ego ma- 
turity is an underlying factor. 
Unfortunately, prognostic studies have 
tended to concentrate on males and there have 
been relatively few studies of female schizo- 
phrenic patients. Orr, Anderson, Martin, and 
Philpot (1955), in studying the degree of as- 
sociation between a number of prognostic 
factors and recovery in a group of female pa- 
tients, report that, although marital status 
was associated with recovery, length of hos- 
pital stay and education were better predictors 
of remission. In another study of prognostic 
factors involving mainly female patients, 
Schofield, Hathaway, Hastings, and Bell 
(1954) failed to find a relationship between 
marital status and recovery. Thus the evi- 


willing to tolerate partially remitted patients at home 
than are conjugal families. 
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dence pertinent to determining the prognostic 
significance of marital status for schizophrenic 
females seems ambiguous and apparently no 
data on the significance of premorbid adjust- 
ment for recovery are available. 

This raises the question of whether marital 
status and level of premorbid adjustment also 
are associated with recovery from schizo- 
phrenia in the case of females. Here the pre- 
diction is less clear than with males. A con- 
stellation of characteristics such as extrover- 
sion, a capacity to establish and maintain 
friendships, and the ability to interact 
smoothly with others—all of which are aspects 
of adequate premorbid functioning—would 
appear to be prognostically favorable for both 
men and women. Such characteristics would 
seem to be inconsistent with the development 
of a chronic schizophrenic syndrome. For 
women, however, marital status may be less 
strongly associated with recovery from the 
disorder, since marriage need not imply the 
same degree of maturity as it does for males. 
In our society the woman’s role in the mar- 
riage process can be a very passive one and, 
indeed, if she has a detached, nonargumenta- 
tive disposition, she may appear more femi- 
nine and more attractive to some males. The 
implication of this position is that for schizo- 
phrenic females social aspects of premorbid 
adjustment should be more strongly asso- 
ciated with remission than marital status. 

This study was undertaken in order to 
determine the prognostic significance of both 
marital status and level of premorbid ad- 
justment for schizophrenic females. It was 
hypothesized that adequacy of premorbid 
adjustment would be more strongly associated 
with recovery for this type of patient than 
would marital status. 


PROCEDURE 


The data for the study were obtained from rec- 
ords available at two North Carolina state hos- 
pitals® The hospitals employ similar treatment 
procedures, each receives its patient, population from 
neighboring counties, and the patients housed at 
each hospital seem to be of comparable ages and 
social classes. All first admission female patients 
admitted to these hospitals between July 1, 1949 

¢ Appreciation is expressed to the administrative 
and professional staffs of Dorothea Dix Hospital, 
Raleigh, North Carolina, and John Umstead Hos- 
pital, Butner, North Carolina, for their cooperation 
in this study. 
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and December 31, 1952 were included in this study 
if they were between the ages of 17 and 40 and 
bore a staff diagnosis of schizophrenia uncomplicated 
by mental deficiency or organic brain damage. Each 
patient was traced for a minimum of 5% years 
after hospitalization, and the time spent in and 
out of the hospital was cumulated. In order to 
maximize the possibility of observing relationships 
between patient characteristics and recovery, a 
rigorous criterion for recovery was used and two 
extreme groups were selected. All patients who 
remained in the hospital no more than 6 months 
and who were out of the hospital at least 5 years 
were included in the Recovered Group. The Non- 
recovered Group consisted of those patients who 
remained in the hospital 5 years or more and who 
were out of the hospital for a total period of less 
than 6 months. All patients whose hospital his- 
tories placed them between these two groups were 
excluded from the study. 

After the two groups of subjects had been 
identified, the following data were tabulated for 
each subject from records obtained at the time of 
her admission: marital status, age, education, work 
history, and number and sex of younger and older 
siblings. All subjects were also rated for adequacy 
of social and sexual premorbid adjustment by 
means of the Phillips Premorbid Scale which was 
slightly modified to facilitate its use with female 
patients. The scale is composed of five subscales on 
each of which the patient is assigned a score rang- 
ing from 0 to 6. Subscale A is designed to measure 
adequacy of recent sexual adjustment; B, social 
aspects of the patient’s sexual life during the 
adolescent period; and C, social aspects of recent 
sexual life. Subscales D and E are intended to 
quantify nonsexual aspects of premorbid inter- 
personal functioning, the former being concerned 
with past and the latter with more recent inter- 


TABLE 1 


CHARACTERISTICS OF RECOVERED AND NONRECOVERED 
Groups OF SUBJECTS 


Recov- Nonre- 
Subjects ered covered 
All subjects N= 50 33 
Age at admission Mean 28.8 29.5 
SD 6.93 $.2 
Education in grades Mean 8.3 9.8 
completed 
SD 3.15 2.96 
Siblings Mean 5.5 5.8 
SD 2.44 2.87 
Per cent working 
prior to admission 70% 87% 
Married, separated, and 
divorced subjects N= 42 22 
Age at marriage Mean 19.3 21.7 
SD 2.83 4.16 
Number of children Mean 2.6 1.9 
SD 2.07 1.70 
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TABLE 2 
Marita Status oF SUBJECTS AT ADMISSION 


Recov- 
ered 


Nonre- 


Subjects covered 


Married Number 39 16 
Percentage 78 49 
Number 8 11 
Percentage 16 33 
Number 3 6 
Percentage 6 18 


Single 


Separated or 
divorced 


personal relationships. An individual’s total score, 
therefore, can vary from 0, indicating excellent pre- 
morbid adjustment, to 30 for dismally poor ad- 
justment prior to hospitalization. 

In order to avoid bias in rating, a clinical psy- 
chologist experienced in the use of the scale rated 
each patient without knowledge of her hospital his- 
tory. This was accomplished by providing the judge 
with only the social history portion of each pa- 
tient’s record. Since the social history had been 
obtained in each case at the time of admission to 
the hospital it contained no biasing information 
about subsequent outcome of the illness. To de- 
termine the reliability of the scale, a subgroup of 
22 subjects, representing a random sample drawn 
from each group, was independently rerated by a 
second judge. The resulting Spearman rho for the 
total scale was .86, indicating a reasonably satis- 
factory degree of reliability. 

For the married, divorced, or separated subjects 
of both groups, age at marriage and number of 
children were also obtained in addition to data 
tabulated for all subjects as described above. 


RESULTS 


Table 1 compares the Recovered and Non- 
recovered Groups on several variables un- 
related to the major purposes of this study. 
It is apparent that these groups are quite 
similar with regard to age at admission and 
number of siblings. There is a trend for the 
Nonrecovered Group to have had more educa- 
tion than the Recovered Group. This dif- 
ference, while not statistically significant, is 
consistent with previous findings for both 
male (Farina & Webb, 1956) and female pa- 
tients (Orr, et al., 1955). The married, 
separated, and divorced subjects of each group 
are also compared on age at marriage and 
number of children in Table 1. 

The findings concerning the marital status 
of the subjects in each group at the time of 
their first admission are presented in Table 2. 
It is readily apparent from this table that a 
greater percentage of Recovered subjects are 


married at admission than Nonrecovered sub- 
jects. The significance of this difference in 
proportion of married to nonmarried (single, 
separated, and divorced) patients was evalu- 
ated by means of a chi square test. The 
resulting chi square is 6.48 which, for one 
degree of freedom, is significant at better than 
the .05 level and suggests that married fe- 
males are more likely to recover from a 
schizophrenic process than are single, sepa- 
rated, or divorced females.” 

The analvsis of scores derived from the 
Phillips Premorbid Scale revealed that the 
Nonrecovered Group has a mean scale score 
about three points higher than the Recovered 
Group. The statistical significance of this 
difference in scores was determined by means 
of a Mann-Whitney U test (Mann & Whitney, 
1947) and the resulting z was 2.36, which is 
significant at the .05 level. It may be con- 
cluded, therefore, that relatively adequate 
premorbid adjustment is associated with com- 
paratively rapid and enduring recovery from 
a schizophrenic episode for this sample of 
patients. 

Thus the results indicate that both marital 
status and adequacy of premorbid adijust- 


7 When the samples from each of the two hos- 
pitals providing subjects are separately analyzed, the 
Recovered and Nonrecovered Groups from one 
hospital are not significantly different with regard 
to the proportion of married to nonmarried patients. 
However, the trends are in the direction shown in 
Table 2. 


TABLE 3 


RELATIVE DEGREE OF ASSOCIATION OF 13 
VARIABLES WITH RECOVERY 


Percent of 
additional 
variance 


7 accounted 
Variable 


Subscale C of Phillips scale 
Education 

Subscale D of Phillips scale 
Worked prior to admission 
Siblings—older male 
Subscale E of Phillips scale 
Age at admission 
Siblings—younger male 
Siblings—older female 
Siblings—younger female 
Marital status 

Subscale A of Phillips scale 
Subscale B of Phillips scale 
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ment are prognostically significant variables. 
However, since a correlation of .74 was found 
between marital status and total premorbid 
rating, the prognostic significance of premor- 
bid social adjustment was evaluated holding 
marital status constant. This was accom- 
plished by determining, for a homogeneous 
group of married patients, the relationship 
of Subscales D plus E to recovery. These two 
groups of items tap social functioning and, 
like the other subscales, good adjustment is 
indicated by low scores. The proportion of 
Recovered patients whose D plus E score 
fell below the median for both groups was 
71% while the corresponding proportion of 
Nonrecovered patients was 25%. This dif- 
ference is significant at the .05 level (,? = 
5.63) and can be interpreted as indicating 
that even in a group of female patients ho- 
mogeneous with regard to marital status, the 
more adequate the level and extent of inter- 
personal relationships manifested prior to the 
disorder, the greater the likelihood of re- 
mission. 

A further attempt was made to determine 
the relative degree of association of marital 
status and premorbid functioning with re- 
covery. A regression analysis * was performed 
using 13 variables, including marital status 
and the five Phillips subscales. In this anal- 
ysis, the most important variable in terms of 
proportion of variance accounted for in the 
dependent measure was selected out. The 
process was then repeated for the remaining 
12 variables and continued until only one 
variable remained. The rank order of these 
variables as determined in this way, as well 
as the additional proportion of variance ac- 
counted for by each is presented in Table 3. 
For these samples of subjects, Subscale C 
of the Phillips scale is most strongly associ- 
ated with recovery. While the score obtained 
on this subscale is in part a function of mar- 
ital status, adequacy of nonsexual interper- 
sonal relationships also determines the score. 
Furthermore, Items D and E, which are con- 
cerned exclusively with social relationships, 
show a relative association with recovery. In 
contrast to this, marital status clearly con- 
tains little or no unique information which 

8 Appreciation is expressed to C. A. Boneau for his 


assistance with this analysis which entailed the use 
of an IBM 650 computer. 


would aid in making a prognostic stc.ement. 
In spite of its significant association with re- 
covery, these results indicate that when the 
other variables preceding it are known, 
marital status adds virtually nothing to such 
a prediction. It is ranked as eleventh most 
important of the 13 variables, being followed 
only by Subscales A and B of the Phillips 
scale, and these three variables jointly ac- 
count for less than 1% of the variance in the 
dependent measure. The close rankings of 
these three variables is understandable since 
Subscales A and B measure presence and sta- 
bility of heterosexual relationships and such 
relationships, especially for women, would 
seem to vary with marital status. 


DISCUSSION 


The reported relationship between premor- 
bid adjustment and recovery seems to indicate 
that it is a particular type of patient who is 
likely to develop a form of chronic schizo- 
phrenia characteristic of a large segment of 
our state hospital populations. As with male 
patients, schizophrenic women who have had 
difficulty establishing satisfactory interpersonal 
relationships and who remained withdrawn 
and aloof from others prior to the disorder 
tend not to recover in comparison to women 
whose premorbid interpersonal relationships 
were more adequate. 

The obtained relationship between marital 
status and recovery may indicate that it is 
the better adjusted and more mature women 
who marry and who, because of their greater 
ego resources, recover more rapidly and for 
longer periods of time than do single women. 
However, many married subjects had shown 
very poor social adjustment prior to the onset 
of schizophrenia. Presumably the relationship 
between marriage and recovery could be made 
much more comprehensible by an analysis of 
such factors as the patient’s motivation for 
marriage, the personality of the husband, 
and the stability of the marriage itself. For 
example, a woman who marries because of 
strong dependency needs, to seek refuge from 
an unhappy family life, or because of passive 
compliance with society’s pressures for con- 
formity may be a poorer candidate for re- 
covery than a single woman unmotivated by 
such immature considerations. In contrast to 
these motivations, a woman who marries be- 
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cause of love and a desire for the responsibil- 
ities and pleasures entailed in having a family 
of her own may be expected to be a better 
prognostic risk if she develops schizophrenia. 
More intensive research into the marriage 
process itself and the motivations underlying 
marriage and the choice of a spouse will be 
necessary to determine the validity of these 
hypotheses. 

The results of the study offer some tenta- 
tive support for the hypothesis that, for fe- 
male schizophrenic patients, marriage is less 
strongly associated with remission than is 
adequacy of social premorbid adjustment. 
Whether comparable findings would charac- 
terize male schizophrenic patients or whether 
there would be a stronger relationship be- 
tween marriage and other aspects of social 
functioning can only be answered by further 
research. A study comparable to this one, 
using male patients and the same criteria for 
recovery and nonrecovery, is now underway. 

Finally, some methodological implications 
may be derived from this and other .studies 
which have been cited. In studies of prognosis 
in schizophrenia, it would appear to be im- 
portant to separate patients by sex before 
analyzing remission data in view of the dif- 
ferent meaning which such factors as marriage 
may have for males as compared to females. 
Failure to make such separations may either 
obscure existent relationships or result in 
inadequate or inaccurate generalizations. Fur- 
thermore, in studying the relationships be- 
tween premorbid functioning and recovery, 
married and single patients should be con- 
sidered separately because of the high positive 
correlation which exists between marital 
status and other aspects of premorbid ad- 
justment. 


SUMMARY 


A cohort of first admission female schizo- 
phrenic patients was identified and two groups 
were selected for study. One group had dem- 
onstrated a relatively rapid and enduring re- 
covery in terms of release from a mental 
hospital while the other group had failed to 
make such a recovery. Married patients and 
patients whose social interpersonal relation- 
ships prior to the disorder were reasonably 
good tended to make a relatively rapid re- 
covery from schizophrenia. Tentative evidence 


was obtained indicating that adequate inter- 
personal adjustment was prognostically more 
significant than marital status for this group 
of female patients. Some methodological im- 
plications of this and similar studies are 
discussed. 
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DEPENDENCY FANTASY AND GROUP CONFORMITY 
IN ULCER PATIENTS’ 


PETER WEISS ? ann WALTER EMMERICH 


University of Colorado 


Alexander and his colleagues have hy- 
pothesized that dependency frustration is a 
critical factor in the etiology of peptic ulcer 
(Alexander, 1934, 1950; Alexander & French, 
1948). According to this view, the close as- 
sociation during infancy between the feeding 
process and receipt of maternal affection re- 
sults in their becoming equated symbolically. 
Continued frustration of interpersonal expres- 
sions of dependency later in life may lead to 
regressive conversion into the wish to be fed, 
resulting in chronic gastric stimulation, and, 
in some cases, ulceration. It has been observed 
that some ulcer patients (Primary Type) ex- 
press their dependency needs directly and 
excessively, whereas others (Reactive Type) 
repress these needs and overcompensate 
through exaggerated ambitiousness and _ in- 
dependence (Kapp, Rosenbaum, & Romano, 
1947; Marquis, Sinnett, & Winter, 1952; 
Scodel, 1953; Winter, 1955). The utility of 
this typology depends upon the actual pres- 
ence of each type within the population of 
ulcer patients and the relative absence of each 
type from a variety of nonulcer populations. 
The present study was concerned with one 
aspect of this problem, and explored the 
hypothesis that Primary Type ulcer patients 
exhibit more dependency in both their fan- 
tasied and actual social relationships than 
nonulcer psychosomatic and nonpsychoso- 
matic controls. 

It seemed reasonable to assume that by 
selecting ulcer patients from the lower socio- 
economic levels, most of the sample would 
consist of the Primary, and not the Reactive 
Type. Obviously, such persons have had 

1This paper is based upon the senior author’s 
thesis in partial fulfillment of the requirements for 
the MA degree, University of Colorado. The authors 
wish to express their appreciation to the staff of the 
Veterans Administration Hospital, Denver, Colorado, 
whose cooperation made this study possible. Special 
thanks go to Abby Adams for her assistance in 
collecting the data. 

2 Now at Olive View Sanitorium, California. 


limited opportunities to prepare for or enter 
these occupations in which strong mobility 
strivings are highly sanctioned. It is also 
believed that their family backgrounds in- 
directly or even actively discourage the inter- 
nalization of status-seeking and achievement 
motives (Scodel, 1953). Further, there is some 
direct evidence that lower socioeconomic ulcer 
patients tend to have Primary Type person 
ality characteristics (Scodel, 1953; Winter, 
1955). 

n Succorance on the TAT (Murray, 1943) 
was chosen as the fantasy measure, as it 
clearly reflects the kinds of fantasies that 
have been attributed to the Primary Type 
ulcer patient. Actual interpersonal dependency 
was assessed in the group conformity situation 
developed by Asch (1952). The amount of 
conformity elicited in this situation presum- 
ably reflects the strength of the individual’s 
need to depend upon the judgments of others. 
Evidence that n Succorance on the TAT and 
Asch’s conformity measure refer to the same 
construct of dependency is provided by Kagan 
and Mussen’s (1956) finding of a positive 
relationship between these two measures in 
male undergraduates. 


METHOD 
Subjects 


The subjects were 38 male patients from the 
Veterans Administration Hospital, Denver, Colorado, 
74% of whom had been hospitalized for 15 days or 
less. Their ages ranged from 25 years to 65 years, 
with a median of 46 years. Thirty-three were mar- 
ried; four were divorced, separated, or widowed; 
and one was single. Hospitalization in a Veterans 
Administration hospital was taken as presumptive 
evidence of lower-middle or lower-class member- 
ship. A crude check of this assumption was provided 
by occupational information secured from hospital 
admission cards. None of the subjects reported 
having professional, semiprofessional, or entrepre- 
neurial positions, and 71% of the sample indicated 
that they were either semiskilled or skilled laborers. 

The three subgroups were formed as follows: (a) 
The ulcer (experimental) group was selected from 
the available gastrointestinal patients who were 
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diagnosed as having peptic or duodenal ulcers un- 
complicated by additional psychosomatic disorders. 
(b) The nonulcer psychosomatic control group were 
patients from the general medicine and surgery 
services whose diagnoses indicated that they had 
disorders of psychosomatic origin, such as asthma, 
rheumatoid arthritis, and neurodermatitis. (c) The 
nonpsychosomatic control group consisted of pa- 
tients having disorders believed not to be psy- 
chosomatic, such as cancer, genito-urinary infection, 
and joint dislocation. The three groups were matched 
roughly for age, marital status, occupation, and 
length of current hospitalization. Since some sub- 
jects were not available for testing on both the TAT 
and conformity measures, the subgroup Ns differed 
for each of these measures (see Table 1). 


Assessment of Dependency Fantasy 


The TAT administration procedure was similar to 
that developed by McClelland, Atkinson, Clark, and 
Lowell (1953). Groups of 5 to 10 patients responded 
on a standardized form to the TAT cards projected 
on a screen. The TAT cards used by Kagan and 
Mussen (1956) to measure dependency were pre- 
sented in the following order: 1, 3BM, 6BM, 7BM, 
12M, 13B, 14, 18BM. The experimenter read the 
following instructions to the group: 


The Veterans Administration Hospital is in- 
terested in your reactions to the pictures which 
we are going to show. It is part of a big research 
program, designed to modernize the patient pro- 
gram here. Please do not put your name on these 
papers. 

A number of pictures will be projected on the 
screen before you. You will have 20 seconds to 
look at the picture and then 5 minutes to make up 
a story about it. Notice that there is one page for 
each picture. The same four questions are asked 
about each picture. They will guide you in your 
thinking and enable you to cover all of the ele- 
ments of a plot in the time allotted. I will keep 
time and tell you when to go on to the next 
question for each story. 

It is important to remember that there is no 
right or wrong way to do this. We are interested 
in your reactions, so feel free to make up any 
kind of story you choose. Do not merely de- 
scribe what you see in the picture, but rather use 
your imagination and tell a good story. Make 
the pictures come alive in your stories. Work as 
fast as you can in order to finish in time. If you 
need more space for any question, use the reverse 
side of the page. The staff joins me in thanking 
you for your interest in this survey. Are there any 
questions ? 


The standardized form consisted of the following 
sequence of questions, taken from McClelland et al. 
(1953, p. 98): (a) What is happening? Who are the 
persons? (b) What has led up to this situation? 
That is, what has happened in the past? (c) What 
is wanted? By whom? (d) What will happen? What 
will be done? 
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Using Kagan and Mussen’s (1956) criteria, a 
dependency score was recorded for themes in which 
the hero seeks help from another person in solving 
a personal problem, or is disturbed over the loss of 
a source of love or support. The dependency fantasy 
score was the total number of cards containing a 
dependency theme, and could vary from zero to eight. 
In order to avoid halo effects, the protocols for each 
card were rated across subjects. Stories were rated 
without knowledge of the subjects’ diagnoses or 
conformity scores. There was perfect agreement be- 
tween two judges’ independent ratings of the 40 
protocols of 5 randomly chosen subjects. 

Administration of the TAT always preceded the 
conformity test, and a patient received both tests on 
the same day. In order to minimize the possibility 
that ‘subjects would connect the two parts of the 
study, different experimenters and experimental rooms 
were used for each part. 


Assessment of Conformity 


Seven confederates participated in the Asch (1952) 
group conformity situation. When subject entered 
the experimental room, the confederates had occupied 
chairs arranged in two rows of four facing experi- 
menter. The remaining chair was placed so that sub- 
ject would be the seventh person to announce his 
judgment. Experimenter read the following instruc- 
tions to the group: 


This is a task which involves the discrimination 
of the lengths of lines. You see a pair of white 
cards in front. The card on the left shows a 
single line; the card on the right has three lines 
differing in length. They are numbered 1, 2, and 3 
in order. One of the three lines on the card on the 
right is equal in length to the standard line on 
the card on the left. You will decide in each 
case which is the equal line. You will state your 
judgment in terms of the corresponding number; 
either one, two, or three. There will be twelve 
such comparisons. Since the number of lines is few 
and the group is small, I shall call upon each of 
you in turn to announce your judgment which I 
shall record here on a prepared form. Please be as 
accurate as possible. Suppose we start at the right 
and proceed to the left and back again to the 
right in the second row. 


The 12 pairs of cards were identical to those 
used by Asch (1952). The seven confederates were 
given prior instructions to respond correctly on 
Trials 1, 2, 5, and 11, and to give the same incorrect 
response on the remaining seven trials. The con- 
formity score was the total number of yields on the 
seven critical trials. Upon completion of the con- 
formity test, subject was informed of its nature, 
and was invited to participate as a confederate in 
the test for the next subject. The other confederates 
were recruited from the neuropsychiatric service. 
Confederates and subjects were never more than 
casually acquainted with each other prior to the 
conformity test. 
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DEPENDENCY AND CONFORMITY IN ULCER PATIENTS 


TABLE 1 


Group MEDIANS AND COMPARISONS ON THE DEPENDENCY FANTASY AND 
Group CONFORMITY MEASURES 


Group 
Ulcer (U) 
Nonulcer psychosomatic (P) 
Nonpsychosomatic (N) 


Group medians 


Group comparisons 


UXN 
I N 


* Rounded to the nearest multiple of .5. 


Dependency 
fantasy 


Group 
conformity 


Median* 
5.0 
4.0 


Median* 


> » values are for one-tailed Mann-Whitney U tests corrected for tied ranks. 


RESULTS AND DISCUSSION 


Both the ulcer and nonulcer psychosomatic 
groups wrote significantly more dependency 
themes on the TAT than did the nonpsy- 
chosomatic controls (see Table 1). However, 
although the ulcer group was more dependent 
than the nonulcer psychosomatics on this 
measure, the difference was not significant. 
Thus, the hypothesis with respect to depend- 
ency fantasy was only partially confirmed. 

The hypothesis was more consistently sup- 
ported on the Asch conformity measure. The 
ulcer group yielded significantly more fre- 
quently than both the nonulcer psychoso- 
matic and nonpsychosomatic controls, and 
the latter two groups did not differ signifi- 
cantly from each other (see Table 1). 

The results confirm the belief that Primary 
Type ulcer patients are more dependent in 
both fantasy and actual social behavior than 
nonpsychosomatics. However, since both the 
ulcer and nonulcer psychosomatic groups 
were differentiated on the fantasy measure 
from the nonpsychosomatics, but not from 
each other, it would appear that strong de- 
pendency needs are not specific to ulcer 
among the psychosomatic disorders. This con- 
clusion is supported by the clinical observa- 
tion that unresolved dependency occurs in a 
number of psychosomatic disorders (Alex- 
ander, 1950; Alexander & French, 1948), and 
by Streitfield’s (1954) finding of no difference 
between ulcer and nonulcer psychosomatic 
patients on Rorschach and Blacky measures 
of oral dependency. 


The fact that group conformity was greater 
in the ulcer than in the nonulcer psychoso- 
matic group is consistent with the findings of 
Scodel (1953) and Marshall (1960), and 
indicates that specificity lies in the way that 
dependency is mediated in actual behavior, 
rather than in need strength per se. Whereas 
the ulcer group tended to express their de- 
pendency needs directly, the nonulcer psy- 
chosomatics inhibited these needs in the con- 
formity situation. Such inhibition, together 
with the relatively strong dependency wishes 
revealed by the TAT, suggests that the non- 
ulcer psychosomatic patient has considerable 
anxiety about his dependency needs. It is 
possible to explore this hypothesis by com- 
paring the groups on TAT dependency themes 
rated for dependency-anxiety. A dependency 
theme was scored as anxious if its ending 
indicated nonfulfillment of dependency be- 
cause of object loss, rejection, or punishment 
by the object of nurturance. This criterion 
was met by 34% of the themes in the ulcer 
group, 50% in the nonulcer psychosomatic 
group, and 26% in the nonpsychosomatic 
group.* These findings are consistent with the 
interpretation, but further research is neces- 
sary to provide a more definitive test and to 
determine if dependency-anxiety in nonulcer 
psychosomatics is accompanied by overcom- 
pensation and/or other specific defenses. 


3 Significance tests of group differences on this 
measure were not feasible because of the large per- 
centage of subjects who received zero scores in con- 
junction with small subgroup Ns. 
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In the nonpsychosomatic group, the TAT 
fantasy and Asch conformity measures were 
positively and significantly correlated (rs = 
+ .67, p< .05). This finding cross-validates 
the results of Kagan and Mussen (1956), and 
supports the belief that the two measures 
refer to the same dependency construct. How- 
ever, these measures were not significantly 
correlated in the ulcer, nonulcer psychoso- 
matic, or combined psychosomatic groups. 
The present study allows only for speculation 
as to why these two measures did not cor- 
relate in the psychosomatic groups. Perhaps 
the relatively high dependency wishes of psy- 
chosomatics in connection with the structure 
of the conformity situation posed a dilemma 
which they did not resolve consistently from 
trial to trial. In the nonulcer psychosomatic 
group, internalized conflict arising from 
dependency-anxiety may have resulted in 
alternating expressions and inhibitions of the 
yielding tendency, thereby attenuating the 
correlation. In the ulcer group, the external 
conflict between trying to please the group by 
yielding to consensual error and alienating the 
experimenter by not being as “accurate as 
possible” could have had the same effect. 
Further study using more refined measures 
of conformity is needed to explore these 
possibilities. 


SUMMARY 


This study investigated the hypothesis that 
Primary Type ulcer patients exhibit strong 
dependency in both their fantasied and actual 
social relationships. Subjects were 38 lower- 
middle- and lower-class male Veterans Ad- 
ministration Hospital patients, subdivided 
into ulcer, nonulcer psychosomatic, and non- 
psychosomatic groups. Dependency fantasy 
was measured by the number of dependency 
themes elicited on selected TAT cards, and 
actual dependency by the number of yields 
in the Asch group conformity situation. The 
three groups were compared on both measures. 

As expected, the ulcer patients wrote more 
dependency themes and exhibited greater group 
conformity than the nonpsychosomatics. The 
ulcer group was also more conforming than 
the nonulcer psychosomatics. However, ulcer 
patients were not significantly more dependent 
in fantasy than the nonulcer psychosomatics, 


although the latter group exhibited more de- 
pendency fantasy than the nonpsychoso- 
matics. 

The pattern of findings suggested that 
strong dependency needs exist generally in 
psychosomatic patients, but whereas these 
needs are often expressed directly by the 
Primary Type ulcer patient, they tend to be 
inhibited by nonulcer psychosomatics. Sup- 
plementary data on differential dependency- 
anxiety among the three groups provided 
tentative support for this interpretation. 

The TAT dependency fantasy and Asch 
conformity measures were correlated in the 
nonpsychosomatic group, but were not sig- 
nificantly related in the two psychosomatic 
groups. It was suggested that different forms 
of dependency conflict in the two psychoso- 
matic groups may have occurred in the con- 
formity situation, thereby attenuating the 
correlation. 
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UNREALISTIC SELF-ENHANCEMENT IN 
PARANOID SCHIZOPHRENICS 


PHILIP H. HAVENER®? anp CARROLL E. IZARD 


Vanderbilt University 


In one of the early studies dealing with 
self-perception, Frenkel-Brunswik (1939) ex- 
amined the relationship between self-reports 
and sociometric ratings and observed the 
tendency of some subjects to utilize a mecha- 
nism of defense which she termed distortion 
into the opposite; e.g., perceiving one’s self 
as being sincere while peers perceived lack 
of sincerity. In another early study in this 
area, Sears (1936) described the related de- 
fense mechanism of projection as the tend- 
ency of subjects to attribute to others cer- 
tain negative characteristics which they them- 
selves possessed, according to the judgment 
of their peers. These defense mechanisms of 
“distortion into the opposite” and “projec- 
tion” are herein interpreted as over-valuation 
of self and devaluation of others, defining as- 
pects of the more inclusive defense process of 
unrealistic self-enhancement. 

The self-theory of Rogers (1947, 1950) 
and Combs and Snygg (1959) leads to the 
general hypothesis that self-satisfaction and 
self-acceptance are positively related to ad- 
justment and acceptance of others. Several 
studies (Rogers & Dymond, 1954) confirmed 
this hypothesis. However, some researchers 
failed to find a linear relationship between 
self-acceptance or self-satisfaction and ad- 
justment (Bennett, 1957; Friedman, 1955). 
These studies suggested that some types of 
maladjusted persons, notably paranoid schiz- 
ophrenics, made unrealistc self-appraisals 
and were not dissatisfied with themselves to a 
degree commensurate with the degree of their 
disturbance. These findings pointed to the 
tenabilty and explanatory value of the con- 
struct of unrealistic self-enhancement. 

An underlying assumption of the present 


1This paper is a condensation of part of the 
senior author’s doctoral dissertation, completed at 
Vanderbilt University under the junior author’s 
supervision. 

2Now at Orange County Community College, 
Middletown, New York. 


study is that unrealistic self-enhancement re- 
sults from the individual’s failure to develop 
or maintain adequate self-related positive 
affect and satisfying interpersonal affective 
ties. Self-related positive affect—positive feel- 
ings toward self, self-respect, self-esteem, 
self-confidence—is seen as a basic need in 
healthy personality development. As an indi- 
vidual receives interpersonal positive affect— 
positive feelings, respect, esteem—from 
others, his self-related positive affect increases 
along with his own tendency to express inter- 
personal positive affect. 

It was thought that subjects making ex- 
tensive use of paranoid defenses would dem- 
onstrate a marked tendency to utilize unreal- 
istic self-enhancement—overvaluation of self 
with concomitant devaluation of others. In 
the more operational terms of this study, the 
general hypothesis was that paranoid schiz- 
ophrenics will show more distortion in the 
perception of self and others as measured by 
objective inventories than will persons not 
using paranoid defenses. The specific hy- 
potheses were: 


H,: Paranoid schizophrenic subjects will 
show a greater difference between expressed 
acceptance of self and expressed acceptance 
of others than will normal subjects and non- 
paranoid schizophrenic subjects. 

He: Paranoid schizophrenic subjects will 
accept fewer mildly self-derogatory state- 
ments about themselves than will normal sub- 
jects and nonparanoid schizophrenic subjects. 

H;: Paranoid schizophrenic subjects will 
express a greater amount of self-satisfaction 
than will normal subjects and nonparanoid 
schizophrenic subjects. 


METHOD 


A group of 20 hospitalized paranoid schizophrenics 
was selected on the basis of a diagnosis as extreme 
and chronic in their paranoid behavior and on the 
basis of a high score on the C (Paranoid Projec- 
tion), H (Submissiveness vs. Belligerence), and J 
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(Self-depreciation vs. Grandiose Exclusiveness) fac- 
tors of the Scale for Rating Psychiatric Patients 
(Lorr, 1953). A second group of 20 hospitalized 
schizophrenics was chosen on the basis of a diag- 
nosis other than paranoid and a low score on the 
C, H, and J factors of the same scale. A group of 
20 attendants at the same hospital formed the second 
control group. All three groups were equivalent in 
age, school grade completed, intelligence, and sex. 

The Tennessee Department of Mental Health 
(TDMH) Self-Concept Scale was used to measure 
distortions in the perception of self (Fitts, 1956, 
1959). It consists of 100 statements descriptive of 
self in five different areas of self-concern, 45 state- 
ments being positive, and 45 being negative. Ten 
additional statements, taken from the MMPI Lie 
scale, are included to measure a person’s honesty in 
responding. Distortion in the perception of self was 
measured by two scores from the Self-Concept Scale. 
One was self-enhancing defensiveness as operation- 
ally defined by the LZ score. The second was self- 
satisfaction as operationally defined by the difference 
between Row I (abstract description of self) and 
Row II (satisfaction with self). 

The Berger Scale (Berger, 1952), which contains 
36 items referring to self and 28 items referring to 
others, was used to measure distortion in perception 
of others. Operationally, distortion in the perception 
of others was defined as a large difference between 
acceptance of self (AS) and acceptance of others 
(AO), with the latter as the subtrahend. 


RESULTS 


In general, the paranoid schizophrenics 
demonstrated more distortion in the percep- 
tion of self and others than did the nonpara- 
noid schizophrenics or the normal group. The 
data indicated that the paranoid group has 
an inflated concept of the self and a low 
concept of others. 

To test H, the mean scores of AS—AO for 
the first three groups were compared by anal- 
ysis of variance and ¢ tests. The assumptions 
of normality of distribution and homogeneity 
of variance were tested by x? for goodness of 
fit and the F,.x test (Bliss & Calhoun, 1954), 
respectively. In both cases the null hypothesis 
could not be rejected. Using the formula for 
deriving the reliability of difference scores 
- (Nunnally, 1959), the reliability coefficient 
for AS-AO was .80. The analysis of variance 
of the AS-AO scores showed that the group 
of paranoid schizophrenic subjects had a sig- 
nificantly larger mean score than did the 
normals and nonparanoid schizophrenic sub- 
jects. The F was 7.23, with p less than .01. 
The ¢ comparing the mean AS-AO score in 


the paranoid and normal groups was 2.9, 
which is significant at the .005 level of con- 
fidence. The comparable ¢ for the paranoid 
and nonparanoid groups was 3.4, which is 
significant at. the .005 level. These analyses 
supported H,. 

The AS and AO scores of the three groups 
were compared separately by ¢ tests. The 
mean AS score for paranoids was higher than 
that of nonparanoid schizophrenics (¢ was 
1.97, p approached .05) and approximately 
equal to that of normals. Similarly, on the 
total net positive (TNP) self-concept score 
derived from the TDMH scale, paranoids 
had a mean score approximately equal to that 
of normals and substantially higher than that 
of nonparanoids (¢ was 1.82, p less than .08). 
By contrast, the nonparanoids had signifi- 
cantly lower mean scores for AS (p less than 
01) and TNP (> less than .05) than did 
normals. The mean AO score of the paranoid 
schizophrenics was significantly lower than 
that for normals (¢ was 5.46, p less than 
.005); it was lower than that for nonpara- 
noids but not significantly so. Nonparanoids 
also had a significantly lower mean AO score 
than normals (¢ was 4.45, p less than .005), 
but the mean difference was not as great as 
that for paranoids and normals. 

To test Hs the mean ZL scores for the three 
groups were compared by analysis of variance 
and ¢ tests. The assumptions of normality of 
distribution and homogeneity of variance 
were tested by ,? for goodness ot fit and Fimax 
test and found tenable. The analysis of vari- 
ance of the LZ scores showed that paranoid 
schizophrenics had a significantly smaller 
mean Z score than did the normal subjects 
but not significantly smaller than the non- 
paranoid schizophrenics. The F was 3.86, p 
less than .05. The ¢ between the means of 
the paranoid schizophrenics and the normals 
was 3.0, p less than .01. The ¢ between the 
means of the paranoid schizophrenics and the 
nonparanoid schizophrenics was 1.1, p greater 
than .05. The mean difference was in the 
predicted direction. These analyses supported 
part of Hp. 

To test H; the mean self-satisfaction scores 
(Row I—Row II on the TDMH Self-Concept 
Scale) for the three groups were compared 
by analysis of variance and ¢ tests. The as- 
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sumptions of normality of distribution and 
homogeneity of variance were tested by ,’ 
for goodness of fit and the Fy, and found 
tenable. The difference score reliability coef- 
ficient for Row I-Row II was .78. The anal- 
ysis of variance of these scores showed that 
the paranoid schizophrenics had a Row I-Row 
II score significantly smaller than did the 
normal subjects but not significantly smaller 
than the nonparanoid schizophrenics. The F 
was 3.81, p less than .05. The ¢ for the means 
of the paranoid schizophrenics and nonpara- 
noid schizophrenics was .86, p greater than 
.20. The mean difference was in the predicted 
direction. These analyses partly supported Hs. 

An additional finding was that the paranoid 
schizophrenic subjects had a more positive 
concept of their personal characteristics than 
of their family relationships, whereas the 
normal and nonparanoid schizophrenic sub- 
jects had a more positive concept of their 
family relationships than they did of their 
personal characteristics. This finding was 
obtained by comparing the three groups on 
the difference between two subscores of the 
Self-Concept Scale: Column C (which meas- 
ures personal characteristics) and Column D 
(which measures family relationships). The 
difference-score reliability for Column C—Col- 
umn D was .67. The data were analyzed by 
t test. The ¢ for the difference between the 
means of paranoid schizophrenics and non- 
paranoid schizophrenics was 2.33, p less than 
.05; t for paranoid schizophrenics and nor- 
mals was 2.76, p less than .05. This was a 
post hoc analysis, since these Self-Concept 
Scale subscores were not known to the writers 
at the time the research was planned. 


DISCUSSION 


The results showed that the discrepancy 
between acceptance of self and acceptance of 
others was significantly greater for paranoid 
schizophrenics than for normals and non- 
paranoids. The degree to which paranoids 
expressed positive self-perceptions (in terms 
of AS and TNP scores) was approximately 
equal to that of normals and substantially 
greater than that for nonparanoid schizo- 
phrenics. To what extent this points to verid- 
ical self-acceptance in the paranoid schizo- 
phrenic is not evident, but it is reasonable to 


assume that the numerically equal AS and 
TNP scores of paranoids and normals are not 
psychologically equivalent. While the para- 
noids’ mean scores on these measures of self- 
regard were inflated in comparison with those 
of nonparanoid schizophrenics, their mean 
scores on the measure of acceptance of others 
was somewhat lower than that of the non- 
paranoid schizophrenics. Izard (1959) found 
that the paranoid not only tends to express 
unfavorable feeling toward others but actually 
perceives others (photographs of normals) as 
hostile, suspicious, and threatening. These 
results are considered as evidence in support 
of the basic assumption of the present study 
that paranoid schizophrenics have inadequate 
interpersonal affective ties and an overbun- 
dance of affect directed toward self and 
defense of self. 

Speculatively, we might say that the para- 
noid schizophrenic’s self-acceptance and self- 
satisfaction are nonveridical and that, indeed, 
he cannot experience genuine self-acceptance 
because he is lacking in self-related positive 
affect—veridical positive feelings toward self, 
self-respect, and self-esteem. His deficiency 
of self-related positive affect developed as a 
function of insufficient interpersonal rewards, 
which in turn is related to his tendency to 
devalue other people and to perceive them 
as hostile and threatening. Thus, his un- 
realistic self-enhancement—overvaluing of self 
and devaluing of others—can be seen as a 
defense against complete loss of self-related 
positive affect and of interpersonal positive 
affect. 


SUMMARY 


The basic assumption underlying this study 
was that adequate self-related and interper- 
sonal positive affect are necessary for effec- 
tive adjustment. The general hypothesis was 
that paranoid schizophrenics would show 
greater distortion in the perception of self 
and others than nonparanoid schizophrenics 
or normals. The specific hypotheses are listed 
below: 


H,: Paranoid schizophrenic subjects will 
show a greater difference between expressed 
acceptance of self and expressed acceptance 
of others than will normal subjects and non- 
paranoid schizophrenic subjects. 
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Hz: Paranoid schizophrenic subjects will 
accept fewer mildly self-derogatory statements 
about themselves than will normal subjects 
and nonparanoid schizophrenic subjects. 

H;: Paranoid schizophrenic subjects will 
express a greater amount of self-satisfaction 
than will normal subjects and nonparanoid 
schizophrenic subjects. 


Twenty paranoid schizophrenics, 20 non- 
paranoid schizophrenics, and 20 normals were 
given the Berger Scale and the Tennessee 
Department of Mental Health Self-Concept 
Scale. These inventories furnished indices of 
acceptance of self (AS), acceptance of others 
(AO), self-satisfaction, and an L score. The 
three hypotheses were tested by comparing 
the three groups on AS—AO, self-satisfaction, 
and L scores by means of analysis of variance 
and ¢ tests. The results confirmed H, and 
part of Hz and Hg. An additional analysis 
of two subscores of the Self-Concept Scale 
further substantiated the general hypothesis 
by showing that paranoid schizophrenics had 
higher opinions of their personal character- 
istics and lower opinions of their family re- 
lationships than did normals and nonparanoid 
schizophrenics. 

The results showed that the paranoids tended 
to overrate self as compared with nonpara- 
noid schizophrenics. It was reasoned that this 
was evidence of unrealistic self-enhancement, 
a defense against complete loss of genuinely 
positive self-related affect and of satisfying 
interpersonal affective ties. 
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At the present time, there is relatively little 
systematic fully objective knowledge about 
the overt behavior of chronic schizophrenic 
individuals hospitalized in large neuropsychi- 
atric institutions. 

Existing techniques for the measurement of 
overt patient behavior are typically rating 
scales in which various conceptions of psy- 
chopathology have been dimensionalized. The 
most prominent and probably most widely 
used of these is Lorr’s Multidimensional Psy- 
chiatric Rating Scale (Lorr, Jenkins, & 
O’Connor, 1955). There are also measure- 
ment techniques developed for special pur- 
poses to record particular aspects of patient 
behavior (Laffal, Sarason, Ameen, & Stern, 
1957; Pace, 1957). Lacking are measuring 
instruments free of any assumptions about 
how psychopathology is organized in neuro- 
psychiatric patients and which are sufficiently 
comprehensive to yield a full, objective pic- 
ture of the patients’ adjustment in the hos- 
pital environment. In other words, what is 
needed to provide a firm basis for projecting 
hypotheses about the nature and structure of 
the behavior of neuropsychiatric patients are 
measuring instruments which tell us more 
about what the patient actually does or does 
not do, rather than instruments which tell us 
where his behavior places him on a continuum 
of psychopathology. 

The lack of such objective, comprehensive 
methods for assessing patient behavior be- 


1 Hunter carried out this study while at the 
Veterans Administration Hospital, Montrose, New 
York. 

2Schooler carried out this study while at the 
Veterans Administration Hospital, Montrose, New 
York. 


comes especially apparent when one confronts 
the task of measuring and evaluating the 
effects of a socio-environmental treatment 
program. This lack provided the impetus for 
the development of the observational in- 
strument, the Location-Activity-Inventory 
(LAI),® to be described here. This article is 
concerned with a description of the develop- 
ment of this instrument, its major relevant 
characteristics and use, its reliability and 
finally the kinds of patient “images” that 
analysis of the LAI data is capable of 
vielding. 

The LAI was designed to provide knowl- 
edge about the behavior of chronic schizo- 
phrenics as relevant to the environment in 
which they characteristically function—the 
ward of a large neuropsychiatric hospital. 
Specifically, the instrument was developed to 
measure the behavioral characteristics of a 
group of 100 male patients on two closed 
wards of a neuropsychiatric hospital and to 
study their responses to a projected socio- 
environmental treatment program. The major 
objective in its development was to obtain, 
through a time sampling method, a reliable 
and comprehensive description of the overt 
ward behavior of chronic schizophrenic 
patients. 

Extensive free hand observations had dis- 


8 Copies of the manual detailing the method of 
observation and the operational definitions of ob- 
servational categories have been deposited with the 
American Documentation Institute. Order Document 
No. 6940 from ADI Auxiliary Publication Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 
for microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 
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closed four major dimensions of patient ward 
behavior as relevant: (@) geographical loca-' 
tion in the ward setting; (5) position in re- 
lation to the wall; (c) posture or body dis- 
position; and (d) activity or response to ward 
stimuli. 

These particular dimensions were at length 
chosen to represent patient behavior because 
they seemed to discriminate between patients 
and because they appeared to reflect the terms 
in which patients make their characteristic ad- 
justment, both pathological and nonpatholog- 
. ical to the ward environment. Furthermore, 
it was felt that they would yield behavioral 
profiles permitting systematic inferences re- 
garding the dynamics of behavior and 
pathology. 

The specific categories used to describe be- 
havior characterizing the four dimensions 
were selected on the basis of three criteria: 
first that the need for subjective judgment 
on the part of an observer be minimal; sec- 
ond, that the categories be exhaustively 
descriptive; and, third, that the categories 
be as specific as possible without ceasing to 
represent meaningful units of behavior. 

As regards the Location dimension the total 
ward space was subdivided into 14 functional 
geographical areas. The presence of a patient 
in one of these areas when observed thus 
constituted his location response. The Posi- 
tion dimension was characterized in terms of 
six response categories: “Against the Wall,” 
“Facing the Wall,” “Against Corner,” “Fac- 
ing Corner,” “Looking out Window,” and 
“None of the Above.” The Posture categories 
consisted of “Sitting,” “Standing,” “Lying,” 
“Walking,” “Sitting on the Floor,” and 
“Lying on Floor.” Finally, the categories 
descriptive of the Activity dimension; i.e., 
patient responses to the varied stimuli of the 
ward environment were divided into several 
broad classes. These are (a) social activities 
(verbal-social, gestural-social, laughing-social, 
physical contact, and social games); (5) 
parasocial activities (reading, writing, non- 
social games, TV viewing, verbal, laughing, 
and gestural parasocial); (c) functional, non- 
social activities (ward service, personal care, 
excretory, and functional object manipula- 
tion); (d) nonfunctional activities (self- 
manipulation, active movement, nonfunctional 
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object manipulation, verbal, gestural, and 
laughing nonsocial); (¢) no behavior (“null” 
behavior and sleep). 

While these categories may seem to be face- 
valid and obvious, precise definitions of each 
were required in order to obtain interobserver 
agreement. For example, a patient was judged 
to be “against the wall” if his body or the 
back of the chair in which he was seated was 
within 2 feet of the wall. The care that had 
to be exercised in this regard is perhaps even 
better illustrated by the definition of “tele- 
vision viewing”: “A subject will be judged as 
watching television in the presence of either 
a specific posture response or an observable 
empathic or reactive response to the television 
set. The posture response refers to a position 
of the body which requires some special strain 
to orient the head toward the television set.” 


PROCEDURE 


The LAI is used by a single observer. A trained 
observer requires about 15 minutes to record the 
behavior of 50 patients on a ward. He moves through 
the ward along a predetermined route. In each of 
the 14 areas of the ward, he notes the number and 
identity of the patients present as he enters. He 
then observes each patient for a maximum of 10 
seconds and records behavior using a single category 
in each of the four major dimensions. The cate- 
gories are regarded as mutually exclusive. When a 
patient is engaged in more than one activity, the 
observer makes a determination of which is the most 
salient. After recording the behavior of patients in 
a given area, the observer moves on to the next 
area and repeats the procedure until all areas have 
been covered. In general, the observation procedure 
is roughly analogous to taking a panoramic photo- 
graph of all the patients on the ward, with a 10- 
second exposure and using a film sensitized to pick 
up specified dimensions of behavior. 

Observers did not make data-yielding observations 
with the LAI until they had made practice obser- 
vations on the wards for some time and until their 
presence on the ward had become commonplace from 
the patient’s point of view. The time, the order in 
which observers moved through the geographical 
areas of the ward, and the observation posts adopted 
by the observers were varied for each occasion of 
observation to minimize the predictability of the 
observer’s appearance on the ward or his pattern 
of movements. 


RESULTS 


The data reported in this article represent 
6 months of observation by four observers on 
two wards. These observations were made 
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TABLE 1 


PERCENTAGES OF TOTAL FREQUENCIES OF OBSERVATIONS IN THE VARIOUS CATEGORIES 


Position Posture 


Activities 


Against wall 
Against corner 
Facing wall 

Facing corner 
Looking out window 
None of the above 
No recording 


Sitting 
Standing 
Lying 
Walking 
Sitting on floor 
Lying on floor 
No recording 


& & WwW 


Total 


during the base-line phase of an experimental 
ward project while the wards were function- 
ing under a characteristic custodial regimen. 
During this period each waking hour of the 
patient’s day (total = 12) and each day of 
the week were equally sampled. Observations 
were scheduled in such a way as to avoid 
systematic time of day or day of week bias. 
An average of 39 patients present on each 
ward was observed on 240 occasions yielding 
a total of approximately 9,000 patient oc- 
casions of observation or an approximate 
average of 90 observations for each of the 
100 patients observed. 

The reliability of the instrument was ex- 
amined from two points of view: degree of 
agreement among observers and _ split-half 
reliability of categories based on correlations 
between odd and even occasions of observa- 
tion. Observer reliability was established by 
randomly pairing observers and having them 
make joint but independent observations. 
Contingency coefficients for each pair were 
calculated for the position, posture, and 
activity categories. The contingency coeffi- 
cients for position averaged .84, for posture 
86, and .78 for the activity dimension. The 
split-half reliability coefficients calculated for 
individual categories averaged .91 across the 
categories. (These correlations are Pearson r’s 
corrected by the Spearman-Brown prophecy 
formula for the full length of the scale). 
These analyses indicate a reasonable degree of 
observer agreement in making behavioral 
judgments and a high degree of stability 
of patient behavior on a group basis. 

The behavioral profile characterizing the 
chronic schizophrenic patients observed with 


Null behavior 

Sleeping 

Social (Interaction) 
Parasocial 

Functional Nonsocial 
Nonfunctional (Pathological) 


9.2 
18.7 


Total 100.0 


the LAI is shown in Table 1. The percentages 
shown in the various categories represent the 
proportions of total frequencies cumulated by 
all patients in the position, posture, and 
activity dimension.* 

The most striking aspect of the above table 
is found in an examination of the activity 
categories, in which one finds that patients 
spend 44% of their time in null behavior, 
(total immobility) and 11% in sleeping. This 
means that during the waking day on the 
ward, patients spend more than half of their 
time in the absence of any kind of overt 
activity neither talking nor reading, nor even 
scratching themselves. In contrast to the large 


*It should be noted that the proportions shown 
in the table represent cumulated frequencies for the 
total group of patients and do not reflect the con- 
tributions of individuals to the total percentage or 
interindividual variability. 

The group of 100 male patients characterized by 
these data were all diagnosed as schizophrenic (29 
paranoid, 37 hebephrenic, 14 catatonic, 10 simple, 10 
undifferentiated or mixed). They had been hospital- 
ized for an average of 8.9 years and had an average 
age of 39.8 years. No patients with known organic 
brain damage were included. 


TABLE 2 


PERCENTAGES OF SOCIALIZERS AND NONSOCIALIZERS 
DIsPLAYING NULL, PARASOCIAL, AND PATHOLOGICAL 
BEHAVIOR 


Behavioral 
category 


Null 

Parasocial 

Pathological 
(nonfunctional) 


49.52 
12.98 


14.72 19.99 


* Significant at <.05 level of confidence. 
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amount of time doing nothing, the patients 
spend only 2.7% of their time in social 
activities. Furthermore, this figure must be 
interpreted with care, for the median fre- 
quency for all social behavior is zero. 

While not all possible interrelationships 
among categories and between categories and 
other independent measures of behavior have 
been examined, the following are presented to 
exemplify the psychologically consistent pic- 
ture of patient behavior which emerges upon 
internal examination and the rational relation- 
ships of behavior as observed on the LAI 
with other levels of behavioral measurement. 
Thus for example, patients who are char- 
acterized by a relatively high rate of social 
interaction differ from nonsocializers in other 
characteristics as well. 

As can be seen from Table 2, those who 
were observed in at least one social interaction 
differ significantly from those who were never 
observed in such an interaction in several 
ways.® Those who socialized exhibited more 
parasocial and less null behavior. Further- 
more, there is some evidence that socializers 
also exhibited less nonfunctional pathological 
appearing behavior. Although the difference 
between the two groups in this latter behavior 
shown in the table did not achieve signifi- 
cance, a further analysis in which the group 
of socializers were divided by a median break, 
revealed that the high socializers were sig- 
nificantly different from those who did not 
socialize. (High socializers, mean 12.88; non- 
socializers, mean 19.99, f = 1.98, p < .05). 

As regards relationships between measure- 
ments of overt behavior via the LAI and other 
levels of behavioral measurement—such as 
“intrapsychic” factors, a comparison with 
Rorschach data was made using the Becker 
(1956) Developmental Level Score (DL). 
This score is based on a one-response-per- 
card Rorschach in which a high score is in- 
terpreted as representing a high level of per- 
ceptual organization and by inference a high 
level of personality integration. The N here 
is 75. A number of patients proved untestable. 
Significant correlations (p< .05) were 


5 The entries in this table are the average per- 
centages based on individual percentages in the 
several groups. 


found between DL score and social behavior 
(.273); between DL and parasocial behavior 
(.340); between DL and nonfunctional be- 
havior (— .313); between DL and null be- 
havior (— .292). These correlations seem to 
confirm the expectation that the segments of 
behavior measured by the LAI are not unique 
and isolated phenomena but bear significant 
relationships to other behavioral modalities. 

One specialized analysis undertaken with 
LAI was designed to test the popular belief 
that the phases of the moon affect the be- 
havior of the mentally ill patients. While this 
may appear frivolous, it should be pointed 
out that the lunacy hypothesis is widely be- 
lieved by personnel and that systematic 
changes in patient behavior related to phases 
of the moon might reflect personnel expecta- 
tions. For purposes of this analysis observa- 
tional data were divided into four quarters 
corresponding in time to occasions of the full 
moon, the last quarter, the new moon, and 
the first quarter. 

A comparison of relevant indices such as 
nonsocial verbal and other tension reflecting 
pathological behaviors, and of null behavior 
reveals no significant differences in frequency 
of these behaviors associated with the various 
phases of the moon. While not a conclusive 
test of the “lunacy” hypothesis, the analysis 
at least reflects a remarkable degree of con- 
sistency in the behavior of schizophrenic 
patients. 

Data on daily temperature, humidity and 
prevailing weather conditions were also 
gathered and related to several relevant in- 
dices. Again no significant differences sys- 
tematically related to variations in these con- 
ditions were obtained. Two points can be 
made in this connection. The observational 
data were gathered during a period from 
October 1957 to March 1958. The ranges 
of weather variations during this period is 
considerably narrower than the total range 
of climatic variation throughout all seasons 
in the region where the observations were 
made. Had greater extremes of weather con- 
ditions, etc. been included differences in be- 
haviors associated with them might have 
emerged. It is conceivable, on the other hand, 
that chronic schizophrenics are relatively 
oblivious to external weather conditions. In 
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any event the data at hand do not permit a 
conclusive test of this assertion. 

Some as yet unanswerable questions re- 
garding the usefulness of this instrument in 
measuring behavioral change may be raised. 
It is not known how sensitive it is to change, 
ie., how acute the instrument is. The be- 
havioral segments represented by the descrip- 
tive categories may or may not be too gross 
to pick up subtle changes in functioning. 
Consisting as it does, however, of relatively 
specific descriptive categories, change pat- 
terns that do emerge on the LAI may be 
examined in some detail. At the same time, 
the concreteness of the behavioral categories 
tends to safeguard against the possibility of 
observer stereotypes as well as against the 
influence of observer bias when behavioral 
change is anticipated. 

Finally the LAI seems well suited to use and 
application in a variety of neuropsychiatric 
settings with only minor modifications of 
categories to accommodate local conditions of 
geography and stimulus environment. Wider 
application at this juncture would provide a 
means, among other things of exploring sys- 
tematically the effects of different hospital 
environments upon the behavior of schizo- 
phrenic patients. Similarly, the instrument 
might prove of considerable interest in mak- 
ing comparisons between a variety of popula- 
tions. Moreover, the instrument should prove 
useful in studies where a detailed, objective 
description of patient behavior in a natural 
environmental setting is desired. 


SUMMARY 


The Location-Activity-Inventory was de- 
veloped to measure objectively the ward ad- 
justment of neuropsychiatric patients. It is a 
nominal scale, which on the basis of system- 


atic observations provides behavioral profiles 
of patients in the areas of patterns of patient 
location on the ward, characteristic orienta- 
tion to major architectural features, posture, 
and patterns of activities or response to the 
social and nonsocial stimuli on the ward. 
Both intraobserver reliability coefficients and 
split-half reliability coefficients for individual 
descriptive categories are sufficiently high to 
assume a reasonable degree of stability of 
measurement. 

About 9000 patient observations of 100 
deteriorated chronic schizophrenic patients 
obtained with this scale over a period of 6 
months reveal a remarkable dearth of social 
interaction and a preponderance of immobility 
in such patients. 

Psychologically meaningful covariances 
among descriptive categories were observed as 
well as correlations of ward behavior with 
clinical data suggesting that the instrument 
measures more than isolated, unique segments 
of behavior. Applications for use of the in- 
strument in evaluative studies and in a 
variety of settings for comparative purposes 
are suggested. 
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In a 1958 article Rogers presented a the- 
oretical and descriptive formulation of some 
of the process changes in personality and 
personal expression which occur during suc- 
cessful psychotherapy. Since the publication 
of that article a number of investigations have 
been conducted to empirically elucidate and 
operationalize Rogers’ formulation (Hart, 
1961; Rogers, 1959; Tomlinson, 1959; van 
der Veen, 1961; Walker, Rablen, & Rogers, 
1960). 

The first step in this line of research was 
the development by Rogers and Rablen 
(1958) of a scale to quantify the process 
conception of personality. As described by 
Rogers (1959), the rating scale portrays 
aspects of client functioning in therapy along 
a general continuum extending from “rigidity 
and fixity of psychological functioning on the 
one hand to psychological flow and changing- 
ness on the other.” 

For a detailed description of the Process 
Scale the reader should consult the article by 
Walker, Rablen, and Rogers (1960). The 
measure can be briefly characterized, for the 
purposes of this paper, as a_ seven-stage 
standard scale for the rating of typescript 
and/or tape excerpts from therapy interviews. 
The scale as a whole is divided into seven 
vertical subcategories or strands—Feelings 
and Personal Meanings, Experiencing, Incon- 
gruence, Communication of Self, Construing 
of Experience, Relationship to Problems, and 
Manner of Relating. When rating, the raters 
take each strand or subscale into account and 
arrive at a global rating of the client’s process 
level which may range from 1.0 to 7.0. 


1 This investigation was supported (in part) by re- 
search fellowship awards from the United States Of- 
fice of Vocational Rehabilitation, Washington, D. C. 
and by a research grant from the National Institute 
of Mental Health (as part of a larger project), 
Grant No. M-3496. 

2Mimeographed copies of the latest revision of 
the Process Scale may be obtained by sending a 
request to the authors at the Psychiatric Institute, 
University of Wisconsin, Madison, Wisconsin. 


A VALIDATION STUDY OF THE PROCESS SCALE* 


T. M. TOMLINSON anp J. T. HART Jr. 


University of Wisconsin 


This paper reports the latest investigation 
in the series of studies which have been car- 
ried out to determine the reliability and 
validity of the Process Scale. 


HyYPoTHESES 


For the Process Scale to be a useful instru- 
ment for research and assessment it must, in 
the hands of competent judges, yield the 
following kinds of information. First, a scale 
should be able to discriminate, by means of 
obtained ratings, a more successful case from 
a less successful case. 

Secondly, and possibly most important, 
there should be a difference between the ob- 
tained ratings for early as opposed to late in- 
terviews. These differences should be in the 
direction of later interviews yielding a 
higher process rating than earlier interviews. 
Further, the differences should be more pro- 
nounced for interviews from a successful case 
than for interviews from a less successful 
case. 

Carrying the above one step further, if the 
scale is sensitive to the hypothesized changes 
which occur as a function of psychotherapy, 
and if psychotherapy is a reasonably con- 
tinuous phenomenon, then there should be 
differences, though small, between first and 
second halves of an interview. Again the dif- 
ferences should be greater for a more success- 
ful case than for a less successful case. 

The above can be summarized in the form 
of hypotheses as follows: 


1. Process ratings for more successful cases, 
hereafter abbreviated m.s., as rated by mul- 
tiple criteria of outcome, should be signifi- 
cantly higher than process ratings for less 
successful cases, hereafter abbreviated ls. 

2. Process ratings should be significantly 
higher for later interviews than for early 
interviews. 

Corollary 2a: Process ratings should show 
more pronounced differences between early 
and late interviews of a m.s. case than be- 
tween early-late interviews for a l.s. case. 
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3. Process scores will be higher for the 
second half than for the first half of a given 
interview, although this intrainterview dif- 
ference will be smaller than the interinterview 
difference. 

Corollary 3a: Process ratings should show 
more pronounced differences between first 
and second halves of interviews in a mss. 
case than between halves of a Ls. case. 


METHOD 
Cases and Rating Segments 


A sample of 10 cases, considered representative 
of the University of Chicago Counseling Center 
clients, was selected. For each of the 10 cases an 
early interview and a late interview tape recording 
was obtained; the second interview from the begin- 
ning and the next to the last interview. Nine 
2-minute samples of therapy were systematically 
selected from each of the 20 tapes. The sampling 
units were arbitrarily chosen at every 3-minute 
interval within the interview, e.g., 3’interval-2’unit- 
and so on until 
nine units had been selected from each tape. After 
selection these 180 units were coded to disguise their 
origin from the raters, and in addition, the 180 
units were randomly assigned to 18 rating groups 
with 10 rating units per group. For every 2-minute 
unit a tape recording and a typescript of the tape 
recording were given to the judges for ratings. 


Raters 


Two raters were used both of whom had partici- 
pated in previous studies of the Process Scale. These 
two experienced raters rated all 180 units—assign- 
ing one rating to each 2-minute sample. The rating 
period extended over 18 days since each rater rated 
10 of the 180 units per day. The raters made the 
ratings at separate times and were not allowed to 
discuss their ratings with one another—or with 
others. Both raters had previous experience in psy- 
chotherapy and were considered sensitive to 
intherapy events. 


Procedure 


The raters used both tape recordings and type- 
scripts to make their ratings of the 2-minute units. 
Typescripts were necessary because of the poor 
fidelity of some of the tape units. Tape recordings 
were used in addition to the typescripts to pro- 
vide such cues as tone quality, rate of speech, 
emphasis, etc. 

Of the 10 cases selected, 5 were considered more 
successful cases and 5 were considered less success- 
ful cases on the basis of a multicriteria success score. 
The criteria included therapist ratings of outcome, 
patient ratings of outcome, and a self-concept 
Q sort. The ms. and ls. cases were randomly 
assigned to rating groups and all cues in the tape 


TABLE 1 


ANALYSIS OF VARIANCE OF PROCESS RATINGS 


Source df 


Success (S) 
Judges (J) 
Interviews (I) 
Halves (H) 
I 


ann 
XXX 


D2 mt 


xx*xx xX 


x 
Ixs 
Ix 

HXIXJxS 


SS’/Success 
Residual 


1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
x 1 
1 
1 
1 
1 
8 
56 


segments concerning the number of the interview 
or the satisfaction of the client with the therapy 
were eliminated. 

The judges were instructed to rate each unit as a 
separate entity and to try not to relate an earlier 
rating to a later rating. The judges reported no 
influence in their rating due to hearing previous 
segments from the same case, however this was im- 
possible to assess objectively within the present 
design. 


RESULTS AND DISCUSSION 


Using the Pearson method of correlation, a 
coefficient of .65 was obtained between the 
two judges. Considering the complexity of 
the judgmental task this figure was accepted 
as adequate. However, it is not a high re- 
liability figure and the results presented 
hereafter should be interpreted accordingly. 

Before presenting the main body of results 
and discussion it should be pointed out that 
the variables “Interviews” and “Halves” were 
not independent, and the reader should inter- 
pret the findings within this limitation. 

The results then, which are summarized in 
Table 1 are as follows: 


1. Success: The variable success is sig- 
nificant beyond the .01 level in the direction 
of the ms. cases having a higher process 
rating than the l.s. cases and thus supports 
Hypothesis 1. This highly significant differ- 
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ence between success groups supports pre- 
vious results by Walker et al. (1960) and 
Tomlinson (1959), and has proven to be the 
most stable finding in this series of studies. 
This means that the Process Scale is able to 
discriminate, in a highly reliable way, cases 
rated m.s. from those rated Ls. 

2. Interviews: The scale was able to detect 
differences between early and late interviews 
at beyond the .05 level thus supporting 
Hypothesis 2. Process ratings were, on the 
average, higher for the later interview than 
for the early interview. 

3. Interview < Success: Corollary 2a is not 
statistically supported (p> .05), however, 
the result is in the predicted direction. Tenta- 
tively process ratings increase in magnitude 
as a function of success and time of occur- 
rence, e.g., the late, m.s. group received the 
highest mean rating. Table 2 illustrates this 
statement more clearly. 

4. Halves: There is no statistical support 
(p> .05) for the contention that process 
scores will be higher for second than for first 
half of an interview. However, Table 2 shows 
that all means are in the predicted direction 
with one exception, and it can hardly be 
called a major reversal of prediction. 


Judge 1 


T. M. TomitInson AND J. T. Hart, Jr. 


TABLE 2 


MEAN Process RATINGS, NUMBER OF INTERVIEWS, AND COUNSELOR RATINGS OF 
OvuTcoME FoR EAcH or TEN Cases AND Two JUDGES 


5. Halves X Success: Corollary 3a is not 
statistically supported (p> .05), but the 
trend seems clearly supportive of the predic- 
tion. Table 2 shows that the differences: be- 
tween m.s. halves are relatively clear as com- 
pared to the differences between Ls. halves of 
interviews. 

6. Judges: The difference between judges 
is highly significant, (~ < .01). When con- 
sidered in light of the interjudge reliability, 
this result indicates that although judges 
appear to be able to rate up and down the 
scale in a fairly reliable way, they are sub- 
ject to error which makes them consistently 
different in the rating they assign to a given 
segment. It is believed that this differential 
bias is the reason for the significant second 
and third order interactions. Plotting the 
mean process ratings for both judges indicates 
that there is a generally increasing linear 
trend in process ratings as a function of time 
and success, but the magnitude of the inter- 
action depends on which judge makes the 
rating. 


It is not surprising that the difference 
between halves of interviews is not signifi- 
cant. Successful psychotherapy is at least in 
part a function of the number of interviews 


Judge 2 
No. 
of Coun- Early Late Early Late 
inter-  selor — —— 


Case views rating 


2 46 8 2.8 3.2 2.9 3.1 2.6 2.7 2.4 2.6 
3 21 8 3.5 3.8 5.2 5.0 3.9 4.1 4.6 4.8 
+ 25 8 3.4 3.1 4.4 4.8 2.3 2.4 3.4 3.5 
5 14 3.5 3.2 


Mean 


6 28 3 3.2 3.3 3.8 3.9 2.9 2.6 2.8 2.9 
7 27 2 2.7 2.8 2.4 2.6 2.2 2.2 2.4 2.1 
8 27 5 3.4 3.2 3.4 3.5 2.6 2.4 2.5 2.6 
9 26 3 2.7 2.8 2.7 2.4 2.2 2.4 2.5 2.5 
10 38 5 2.5 2.8 2.8 3.0 2.4 2.6 2.5 2.2 
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(Standal & van der Veen, 1957) and it would 
be unusual if within any single interview 
there was a sudden, perceptible personality 
change. For this reason no significant differ- 
ence in mean scores for halves was expected. 
However, if the process of psychotherapy is 
a fairly continuous phenomenon, it would be 
reasonable to expect consistent differences 
between halves, in the direction of the second 
half being higher in process rating than the 
first half. To test this notion a nonpara- 
metric statistical procedure (sign test) was 
used. The differences, holding Success and 
Judges constant, are in the predicted direc- 
tion 13 of 19 times (p = .084) for the early 
interviews and 15 of 19 times (p = .010) 
for the late interviews. Holding Judges and 
Time constant, 16 of the 20 (p= .006) in- 
terviews are in the hypothesized direction 
for the more successful cases while 12 of 18 
(p = .119) are in the positive direction for 
the less successful cases. Thus it seems that 
movement within the interview is both a 
function of time and success. From these re- 
sults it would seem that the process of per- 
sonality change within an interview is directly 
analogous to that which occurs between inter- 
views, i.e., during the course of therapy. The 
only difference is in the magnitude of the 
change. Further, the results suggest that the 
change is a cumulative one whose direction 
and possibly magnitude varies postively as a 
function of success and time. 

If personality change is a function of 
psychotherapy and if the Process Scale is 
sensitive to the behavioral differences that 
occur concomitantly with this change, then 
one would expect these changes to be most 
noticeably reflected in the differences between 
early and late successful interviews. Table 1 
shows this to be the case statistically, but 
the magnitude (See Table 2) of the differ- 
ences is hardly of practical value, being on 
the average only .7 of a stage for Judge 1 
and .3 of a stage for Judge 2. Yet Walker, 
et al., were able to detect mean change scores 
of 1.93 stages in the marked success group 
(N =3) using the same scale. However, 
Walker used cases chosen for the purpose of 
representing end-points of a success-failure 
continuum. The cases used in this study cover 
the entire continuum of success-failure. As 
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such it is reasonable to expect overlap be- 
tween m.s. and ls. cases in the middle of the 
continuum. Further, in analyzing the results 
in terms of means the differences within the 
individual success and/or failure cases are 
washed out. Difference scores were computed 
for each case between early and late inter- 
view ratings and the results suggest that the 
Process Scale detects differences between 
early and late interviews at least in part as 
a function of a success-failure continuum. For 
Judge 1 the differences between early and 
late, m.s. interviews range from 1.5 to .0 
stages, while for Judge 2 the range is .8 to 
—.1 stages. For the ls. group the range for 
Judge 1 is .6 to —.3 stages and for Judge 2, 
.2 to —.1 stages. Thus it appears that the 
Process Scale is sensitive to change as a func- 
tion of success-failure. 

One final comment: This study is the 
latest in a series which have attempted to 
validate the Process Scale. Out of these 
studies one clear finding has always resulted, 
namely that the m.s. case almost invariably 
starts therapy at a higher stage of process 
than the ls. case. In this study, using the 
Mann-Whitney U test to test the significance 
of the difference between early, ms. and 
early, ls. cases, the differences were signifi- 
cant in the expected direction at about the 
02 level (U= 20.5; N,;=10, N2= 10). 
Here as in the previous studies, the ms. 
clients seem to enter therapy at a higher 
stage of process than the ls. ones. This sug- 
gests at least two things: (a) there may be 
some personality characteristics which each 
client must possess in some minimal degree 
to profit from psychotherapy, and (5) those 
clients below this minimum will not progress 
in psychotherapy. This conjecture is similar 
to and supports Kirtner’s (1955) finding that 
the successful client is characterized by dif- 
ferent personality attributes than the client 
who fails to profit from psychotherapy. If 
this approach continues to be fruitful then 
perhaps more light can be shed on the nature 
of and the reasons for the failure case. 


SUMMARY 


The steps which have been taken to ob- 
jectify and validate Rogers’ conception of 
the process of psychotherapy have been sum- 
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marized. The study currently reported, and 
the previous studies justify the following 
conclusions in regard to the Process Scale 
and its operational meaning. 


1. Interjudge agreement is always adequate 
and significantly better than chance, as esti- 
mated by parametric and nonparametric sta- 
tistics. Interjudge reliability appears to have 
a minimum of about .60. 

2. It distinguishes between more successful 
and less successful cases at a high level of 
statistical significance. 

3. It indicates that more successful cases 
begin as well as end at a significantly higher 
level of process. 

4. There is evidence that there is greater 
movement (process change) on the Process 
Scale during the period of therapy in more 
successful than in less successful cases. The 
difference is only modestly significant and its 
utility is yet to be determined. 

5. There is a tendency for the second half 
of each interview to be rated higher on the 
Process Scale than the first half. This tend- 
ency toward intrainterview change is defi- 
nitely significant in the more successful group 
of cases. 


T. M. ToMLINson AND J. T. Hart, Jr. 


REFERENCES 


Hart, J. T. A reliability study of the Process Scale. 
Unpublished MS thesis, University of Wisconsin, 
1961. 

KirtNer, W. H. Success and failure in client-centered 
therapy as a function of personality variables. 
Unpublished MA thesis, University of Chicago, 
1955. 

Rocers, C. R. A process conception of psychotherapy. 
Amer. Psychologist, 1958, 13, 142-149. 

Rocers, C. R. A tentative scale for the measurement 
of process in psychotherapy. In E. A. Rubinstein 
& M. B. Parloff (Eds.), Research in psychotherapy. 
Washington, D. C.: American Psychological As- 
sociation, 1959. Pp. 96-107. 

Rocers, C. R., & Raspren, R. A. A scale of process 
in psychotherapy. Unpublished manual, University 
of Wisconsin, 1958. (Mimeo.) 

StanpaL, S. W., & VAN DER VEEN, F. Length of 
therapy in relation to counselor estimates of 
personal integration and other case variables. J. 
consult. Psychol., 1957, 21, 1-9. 

Tomutnson, T. M. A validation study of a scale for 
the measurement of the process of personality 
change in psychotherapy. Unpublished MS thesis, 
University of Wisconsin, 1959. 

VAN DER VEEN, F. Strand scale analysis of the psy- 
chotherapy Process Scale. Psychiat. Inst. Bull. 
1961, 1, No. 1. 

Watker, A., Rasien, R. A., & Rocers, C. R. De- 
velopment of a scale to measure process change in 
psychotherapy. J. clin. Psychol., 1960, 1, 79-85. 


(Received December 22, 1960) 


4 
ate 
4 
> 
= 
* 
amt 
> 
BE 
4 
14 
4 


Journal of Consulting Psyc 
1962, Vol. 26, No. 1, 79 


hology 
83 


NEED FOR 


SOCIAL APPROVAL AND THE OPERANT 


CONDITIONING OF MEANINGFUL 
VERBAL BEHAVIOR 


DAVID MARLOWE? 
Dartmouth College 


The verbal conditioning paradigm devised 
by Greenspoon (1955) along the lines of 
Skinnerian operant conditioning procedures 
has stimulated a considerable interest in fac- 
tors related to the control of human verbal 
behavior. Recent studies have clearly demon- 
strated that a variety of verbal behaviors can 
be conditioned by operant techniques, that 
various kinds of “social reinforcers” are ef- 
fective, and that conditioning of verbal be- 
havior occurs in the absence of any awareness 
on the subjects’ part as to the “response- 
reinforcement” relationship (Krasner, 1958; 
Salzinger, 1959). While it has been almost 
axiomatic to state that there are marked in- 
dividual differences in susceptibility to verbal 
conditioning, there have been conspicuously 
few reported studies bearing on possible 
personality or motivational correlates of 
verbal conditioning. Studies of the relation- 
ship of anxiety level to conditionability have 
yielded equivocal results (Rogers, 1960; 
Taffel, 1955). The present study was under- 
taken in order to examine the influence of 
“need for social approval” on the condition- 
ing of verbal behavior in a meaningful social 
situation. 

The construct of need for social approval 
and the scale developed to measure this vari- 
able were originally formulated within the 
psychometric framework of a socially desir- 
able response set (Crowne & Marlowe, 1960; 
Marlowe & Crowne, 1961) with the intent to 
account for individual differences in the 
strength of the tendency to respond in a 
socially desirable manner, i.e., in accord with 
situational and cultural demands. Research 
thus far completed uniformly provides sup- 
port for the behavioral properties ascribed to 
the need for social approval variable. Of di- 
rect relevance to the present study is the find- 


1 This study was conducted while the author was 
at the University of Kentucky College of Medicine. 
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ing of Crowne and Strickland (1961) that 
individuals with a strong need for social ap- 
proval emit significantly more plural nouns 
in a verbal conditioning situation than in- 
dividuals less strongly motivated for social 
approval. The present study was designed to 
extend the findings of Crowne and Strickland 
to meaningful verbal communication in an 
interview situation. In accord with the view 
that individuals with a strong need for social 
approval are more sensitive and responsive to 
social reinforcers, it was hypothesized that in- 
dividuals highly motivated for social approval 
would emit more responses in the reinforced 
category than individuals to whom social ap- 
proval is of less importance. In addition, it 
was hypothesized that the high need for ap- 
proval group would show a significant in- 
crease in the reinforced response class com- 
pared to their initial rate of emitting the 
reinforced responses. The latter hypothesis was 
prompted by Rogers’ (1960) inability to 
produce a significant increase from operant 
to conditioned level in the amount of positive 
self-references in a quasitherapy setting sim- 
ilar to that of the present experiment. 


METHOD 
Subjects 


A total of 76 undergraduate college students (34 
males and 42 females) at the University of Kentucky 
volunteered to participate in the experiment. Sixty- 
four subjects were obtained in introductory psy- 
chology classes while 12 of the females were ob- 
tained from an introductory course in nursing. 
Subjects were requested to volunteer for a study of 
“the personal beliefs of college students.” 


Procedure 


The experimental procedure consisted of a single 
15-minute individual interview with each subject. 
The interview was conducted immediately after the 
subject completed the Marlowe-Crowne Social De- 
sirability scale which served as the measure of need 
for social approval. The M-C scale consists of 33 
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items with True and False response categories. An 
illustrative item is, “I never resent being asked to 
return a favor.” 

Each subject was told that the experimenter was 
making a study to learn what college students think 
and feel about themselves, and was instructed to 
describe his personality characteristics and _ traits. 
Following any silence of 20 seconds, the experimenter 
asked one of four questions depending on the con- 
tent of the subject’s verbalizations prior to that 
period of silence. In effect the subject was ques- 
tioned about an area he had not previously discussed. 
The following four questions were used: 


Could you describe yourself in terms of your relation- 
ships with people in general? 

Could you describe yourself in terms of your relation- 
ships with members of your family? 

Could you describe yourself in terms of your social 
life? 

Could you describe yourself in terms of school work? 


Every interview was tape recorded with the sub- 
ject’s knowledge, and later transcribed so that a 
content analysis and reliability check could be car- 
ried out from the complete record. 

Subjects were assigned at random to either the 
experimental or control group without knowledge 
on the experimenter’s part as to the subject’s ap- 
proval score. In the experimental group, every posi- 
tive self-reference was reinforced by the experi- 
menter’s stating “Mm-hm” in a flat monotone. In 
the control group, the experimenter offered no re- 
inforcement, and restricted his comments to asking 
one of the four questions when a 20-second period 
of silence occured. A positive ‘self-reference was 
defined as any statement containing one or more 
references to subject personally as indicated by the 
use of such pronouns as “I,” “My,” “Me,” “Mine,” 
“We,” etc., which served to describe subject in a 
favorable manner. A statement was defined as a 
sentence or complete “thought-unit” after Dollard 
and Mowrer (1947). 


TABLE 1 


MEAN PSR Ratios AND STANDARD DEVIATIONS BY 
PERIOD FOR THE HIGH AND Low NEED APPROVAL 
GROUPS UNDER EXPERIMENTAL AND CONTROL 

CONDITIONS 


Period 
Group 1 2 3 1-15 
Experimental 
High need approval M .181 .233 .166 .194 
(N = 24) SD O75 .094 .095 .064 
Low need approval M .178 130 -146 
(N = 14) SD .086 .079 .058 .062 
Control 
High need approval M 
(N = 18) SD .082 .083 .063 
Low need approval M .203 114 075 .129 
(N = 20) SD 082 067 019 .045 
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FIVE MINUTE PERIODS 
Fic. 1. Mean PSR ratios by period for high and 
low need approval groups under experimental and 


control conditions. 


Immediately following the completion of the 15- 
minute interview, each subject was questioned to 
ascertain the extent to which he was aware of 
the conditioning procedure. The following questions 
were used in this part of the experiment: 


What do you think is the purpose of the experiment ? 

Why did you say what you did? 

Did you pay any attention to me while you were 
talking ? 

Did you notice anything I did while you were 
talking ? 

Why do you think I did that? 

Did it have any effect on what you did? 

What was the effect? 


Reliability for the scoring of positive self-references 
was established by correlating the ratings of the 
experimenter with those independently obtained by 
another scorer who worked from the typed trans- 
scripts with experimenter’s reinforcements omitted. 
A Pearson r of .89 was obtained based on a sample 
of 15 randomly selected interviews. 


RESULTS 


As an initial step in the analysis of the 
data, subjects in the experimental and control 
groups were dichotomized at the overall mean 
M-C score (15.36) to yield the high and low 
need for approval groups. Male and female 
subjects in each of the groups did not differ 
significantly with respect to mean M-C scores 
or rate of emitting positive self-references. 
The final analysis of the data was carried out, 
therefore, on the pooled sample of males and 
females. 

The basic response unit used in the analysis 
is the ratio of positive self-reference (PSR) 
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statements to total statements made. The 15- 
minute interview was divided into three 5- 
minute periods. Mean PSR ratios for each 
5-minute period and for the entire 15 minutes 
were computed for the high and low need for 
social approval subjects in the experimental 
and control groups. Table 1 contains the 
results of this analysis. The mean PSR ratios 
for the 5-minute periods are graphically illus- 
trated in Figure 1. 

In order to test the hypothesis that high 
need for social approval subjects produce 
more positive self-references under positive 
reinforcement than low need for approval 
subjects under positive reinforcement, ¢ tests 
of the differences in mean PSR ratios between 
the relevant groups were computed. For the 
first 5-minute period, no significant differences 
were obtained between the mean PSR ratios 
of any two groups. The maximum difference 
(that between the control highs and experi- 
mental lows) yielded a ¢ value”? of 1.35, 
p = > .05. It would appear then that during 
the first period, the conditioning effect had 
not yet begun to differentiate between the 
groups. The mean PSR ratio of .233 obtained 
by the high approval subjects under positive 
reinforcement during the second period is 
significantly higher (p= < .01) than the 
corresponding mean ratios of each of the 
other three groups. For the last 5-minute 
period, the PSR ratio of .166 obtained by the 
high need for approval subjects under posi- 
tive reinforcement is significantly greater than 
that of the experimental lows (¢ = 1.82, p= 
< .05), and the control lows (¢ = 3.65, p= 
< .01), although not significantly different 
(p = > .05) from the PSR ratio of the con- 
trol highs. When mean PSR ratios for the 
entire 15 minutes are compared, the high need 
for approval subjects in the experimental con- 
dition are significantly (p= < .05) above 
each of the three comparison groups in rate 
of emitting positive self-references. Over-all, 
it may be concluded, as predicted, that in- 
dividuals with a high need for social approval 
emit more positive self-references under posi- 
tive reinforcement than subjects less strongly 
motivated for social approval. 


2 One-tailed tests of significance were computed 
throughout the analysis since directional hypotheses 
were formulated. 


That the differences between the high need 
for approval subjects in the experimental 
condition and the three comparison groups 
are attributable to a conditioning effect, and 
not merely to a decline in the PSR ratios of 
the other groups is indicated by the sig- 
nificant increase (t = 2.48, p = < .05) in the 
PSR ratio of the high approval subjects from 
the first to second period. In contrast both 
control groups evidenced a significant decline 
(p = < .01) in PSR ratios from the first to 
the second period. The decline from the first 
to second period for the experimental lows is 
also significant (¢ = 2.00, p= < .05). The 
significant decline (p = < .01) in the PSR 
ratio of the high approval experimental sub- 
jects during the third period may be likened 
to an “exhaustion” phenomenon. This finding 
is similar to the “exhaustion” of plural noun 
responses obtained by Crowne and Strickland 
(1961) during the later half of their experi- 
mental session. 

For the purposes of the present study, 
“awareness” was defined as an explicit state- 
ment by subject that experimenter said “Mm- 
hm” each time subject made a positive self- 
reference. None of the experimental subjects 
were aware of the “response-reinforcement” 
contingency, a finding similar to those of 
other studies (e.g., Krasner, 1958; Rogers, 
1960). Of the 38 experimental subjects, 27 
reported that they noticed the “Mm-hm.” In 
general, these 27 subjects stated that the 
sound signified “interest” on experimenter’s 
part or “encouragement” to continue talking. 
It may be concluded that changes in the rate 
of emitting positive self-references ocurred in 
the absence of any awareness on subject’s 
part as to the nature of the experimental 
manipulations. 


DISCUSSION 


The results of this experiment add con- 
siderable generality to the frequently reported 
finding that human verbal behavior is subject 
to control through the use of operant con- 
ditioning techniques. Previous interpretations 
of such findings with respect to their implica- 
tions for psychotherapy and interviewing have 
been mitigated by the fact that the task 
typically posed the subjects in past studies 
have seldom required any personally meaning- 
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ful verbal communication. The present find- 
ing, that the frequency with which an in- 
dividual makes positive self-references in an 
interview situation can be modified by min- 
imal social reinforcements without awareness 
on subject’s part, would appear to support 
Krasner’s (1958) contention that process re- 
search in psychotherapy might profitably 
employ operant conditioning techniques. Con- 
ceivably, the change in patient’s verbaliza- 
tions in therapy from negative to positive self- 
references as therapy progresses may reflect 
to some extent, a subtle conditioning effect 
rather than improvement in the patient’s psy- 
chological status. 

The major findings of the present study in- 
dicate that individual differences in condi- 
tionability are in part attributable to differ- 
ences in the strength of the need for social 
approval. As predicted, subjects in the high 
need for social approval group were more 
conditionable as evidenced by the significant 
increase in the rate of responding between 
the first and second periods. In contrast, the 
low need approval group failed to adopt re- 
sponses reinforced by experimenter, as in- 
dicated by the significant decline in the per- 
formance of this group. Similarly, the two 
control groups evidenced a marked decline 
in rate of emitting positive self-references. 
These findings provide a clear-cut confirma- 
tion of data recently obtained by Crowne and 
Strickland (1961) in a study of the influence 
of need for social approval on the operant 
conditioning of plural-noun responses. Over- 
all, the present results lend additional support 
to the construct validity of the need for ap- 
proval variable. Previous research with this 
variable (Crowne & Marlowe, 1960; Marlowe 
& Crowne, 1961; Strickland & Crowne, in 
press) indicates that individuals characterized 
by strong tendencies to do and say the 
socially appropriate thing in order to obtain 
approval are likely to be conforming to ex- 
ternal pressure, and to express attitudes 
congruent with perceived situational demands. 
In this study, the behavior of the high need 
for approval subjects may likewise be inter- 
preted as reflecting the greater sensitivity and 
responsiveness of these subjects to social 
reinforcers as compared to individuals less 
strongly motivated to seek social approval. 
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Instructions to the subjects were deliberately 
brief and nonspecific, requesting only that 
subject “describe his personality character- 
istics and traits.”” Those individuals who are 
most concerned with saying the socially ap- 
propriate thing were most likely to adopt the 
reinforced response. It should be noted that 
only a minor portion of the variance in re- 
sponding with positive self-references is at- 
tributable to the influence of need for social 
approval alone. In the control group, which 
received no reinforcement, an insignificant 
Pearson r of .23 (N = 38) was obtained be- 
tween need for approval scores and the PSR 
ratios over 15 minutes. For the experimental 
subjects, a corresponding r of .42 (p= < .01, 
N = 38) was obtained. Thus, the increase in 
productivity for the reinforced high need for 
approval subjects would seem to reflect their 
greater responsiveness to the reinforcements 
rather than any prior existing tendency to 
emit positive self-references. 

The rather striking and significant decline 
among the experimental highs in the rate of 
emitting positive self-references during the 
third period might reflect a simple exhaustion 
phenomenon in that subjects ultimately run 
out of positive things to say about them- 
selves. An alternative and probably more 
likely explanation is that the decline is due to 
a reluctance on the part of subjects to enu- 
merate repeatedly their positive personal char- 
acteristics. Thus, one high need for approval 
subject who participated in the experimental 
condition spontaneously commented at the 
end of the interview, “It’s awfully hard to 
sit here and toot your own horn.” In line with 
the hypothesis that subjects are self-conscious 
about excessive self-praise is the comment by 
Matarazzo, Saslow, and Pareis (1960) that 
an individual’s pre-experimental history of 
differental reinforcement to particular verbal 
response classes may exert an influence in 
verbal conditioning experiments. It seems 
quite likely that most individuals have de- 
veloped the expectancy that excessive elabora- 
tion of one’s good points is not socially ac- 
ceptable behavior. The decline noted above, 
therefore, might well be due to the individ- 
ual’s prior reinforcement history exerting an 
effect against the specific reinforcements of 
the experimenter once a certain point has been 
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reached in the subjects production of positive 
self-references. 

The present results regarding the condition- 
ability of positive self-references are in con- 
flict with those of Rogers (1960) who failed 
to obtain direct evidence of a conditioning 
effect over a series of six 10-minute quasi- 
therapy sessions. This discrepancy in results 
may be due to the fact that the intervals 
between sessions were of such length as to 
result in extinction of the reinforced responses 
from session to session. This point can only 
be considered suggestive, since Rogers does 
not specify in his report the number of days 
which intervened between each session. 


SUMMARY 


The purpose of this study was to test the 
hypothesis that the conditionability of posi- 
tive self-references in an interview setting is 
related to individual differences in the strength 
of the need for social approval. 

Seventy-six subjects completed the Mar- 
lowe-Crowne Social Desirability scale which 
served as the measure of need for social ap- 
proval and were then individually interviewed 
for 15 minutes under either control or ex- 
perimental conditions. Experimental subjects 
were reinforced by the experimenter’s stating 
“Mm-hm” each time subjects verbalized a 
positive self-reference. Control subjects re- 
ceived no reinforcement. Finally, each ex- 
perimental subject was questioned to deter- 
mine the extent of his awareness of the ex- 
perimental manipulations. Each 15-minute 
interview was tape recorded and later typed. 
High interscorer reliability was obtained for 
the scoring of positive self-references. 

The results obtained indicate that the rate 
of emitting positive self-references in an inter- 
view situation can be altered by operant con- 
ditioning, and that subjects with a strong 


need for social approval produce significantly 
more positive self-references under positive 
reinforcement than a comparable group of low 
need for approval subjects. 

Results were interpreted in terms of their 
relevancy to process research in psycho- 
therapy, and were further seen as contributing 
to the construct validity of the Marlowe- 
Crowne SD scale. 
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This study was designed to yield three 
kinds of information. First, it sought to estab- 
lish the relationship between different sources 
of information about patients’ aggressiveness. 
If the relationships are strong, aggressive- 
ness is sufficiently pervasive to be assessed 
by any one of several means. If the relation- 
ships are weak, aggressiveness is too complex 
and varied to be assessed by a single source 
of information. Second, it related enduring 
aggressive trends to the way hostile materials 
are dealt with in the laboratory. Previous re- 
search has had a similar aim, but this is the 
first study to employ varied personality meas- 
ures of aggression and varied laboratory 
tasks. Furthermore, aggressiveness is divided 
into different modes (assault, indirect, etc.), 
and each mode is related to performance with 
hostile materials. Third, the study sought 
further information on how aggressive be- 
haviors tend to cluster. The Buss-Durkee in- 
ventory (Buss & Durkee, 1957) yielded 
factors of Aggression and Hostility when 
the scores of college students were factor 
analyzed. In the present study the scores of 
patients on this inventory are factor analyzed, 
as well as ratings from other sources of in- 
formation. These factor analyses should indi- 
cate the stability and generality of the two 
factors derived previously. 


MetTHOD 
Subjects 


The subjects were admissions to an acute treat- 
ment center for psychiatric patients. The age range 
was 16 to 60 years, and the diagnoses included neuro- 
sis, psychosis, character disorders, and mixed diag- 


1 Analysis of the data was supported by a re- 
search grant, M-2655 (A), from the National Insti- 
tute of Mental Health, United States Public Health 
Service. 


noses. There were originally 100 subjects, but 4 were 
subsequently discovered to have organic brain dam- 
age, and they were dropped from the study. Of the 
remaining 96 subjects there were 63 women and 33 
men, reflecting the approximately two-to-one ratio 
of women to men admitted to the hospital. When- 
ever there were clear sex differences in the data, the 
men’s and women’s results were treated separately. 


Measures 


Ratings. Each patient’s psychiatrist rated him on 
a seven-point scale for the variables tapped by the 
inventory: Assault, Indirect, Irritability, Negativism, 
Resentment, Suspicion, and Verbal. The psychiatrist 
made his ratings after studying the patient’s social 
and psychiatric history and after at least one 
lengthy interview; test results were not available. 

Each patient’s psychologist made similar ratings, 
using only his observations of the patient’s test 
behavior and the test protocols. 

The closest available relative was interviewed. 
Because relatives are biased and have little experi- 
ence with ratings, they were not allowed to make 
the ratings. Rather, one investigator interviewed 
and later read the protocol to the other two investi- 
gators, and then all three made independent ratings. 
Their intercorrelations ranged from 83 to .96. 
Because of the close agreement the three ratings for 
each category were pooled. 

Tests. The Buss-Durkee inventory was adminis- 
tered to each patient individually. The Rorschach 
was also administered individually, and the Elizur 
(1949) and DeVos (1952) systems were used to 
obtain a hostile content score. The two systems 
were found to correlate .74, which is very high in 
light of the reliability of each system. Therefore 
only the Elizur system was used. 

The Iowa Picture Interpretation Test (IPIT) 
(Hurley, 1955) was also administered individually. 
The form used here consisted of 24 TAT cards, 
with four alternative stories for each picture. The 
subject rated each story for its applicability to 
the picture, and the aggression score was the sum 
of the ratings for the aggressive stories. Because 
low scores equal high aggressiveness and vice versa, 
the correlations between the IPIT and other meas- 
ures were reversed in sign. 

Laboratory tasks. The scrambled sentences tech- 
nique, first reported by Watson, Pritzker, and Madi- 
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TABLE 1 


CORRELATIONS BETWEEN PsyCHOLOGISTs’, PSYCHIATRISTS’, AND RELATIVES’ RATINGS 


Mode of 


aggression 


hologist vs. 
psychiatrist (63) 
Assault 
Indirect 
Irritability .68** 
Negativism .28* 
Resentment an 
Suspicion 46** 
Verbal 07 

Sum 


* Significant at .05 level. 
** Significant at .01 level. 


son (1955), involves a neutral or hostile solution in 
unscrambling four words. Thus “take arm his break” 
can be unscrambled to form “take his arm” or 
“break his arm” when one of the four words is 
omitted. Some of the Watson et al., sentences were 
retained, but most were discarded in favor of 
newly constructed sentences. Thirty sets of words 
were used, and the score was the number of hostile 
solutions. 

The operant conditioning task was a modification 
of that originated by Buss and Durkee (1958). On 
each trial the subject was handed a 3X5 card 
containing a hostile and a neutral word and was 
instructed to make up a sentence using one of the 
words. There were 60 trials in all. On the first 10 
trials the subject made up the sentences without 
reinforcement; subsequently the experimenter said 
“Good” after each hostile response. 


RESULTS 


The results will be reviewed under three 
headings. First, the relationships between the 
inventory and all other measures and tasks 
will be discussed. Next, the relationships of 
the ratings, tests, and laboratory tasks will be 
reported. Finally, factor analyses of the in- 
ventory and the ratings will be reviewed. 


The Inventory 


The inventory scores were correlated with 
each of the other types of measures: ratings, 
tests, and laboratory tasks. 


Ratings. Before inquiring about the re- 
lationship between these three sets of ratings 
and the inventory, it is important to establish 
the relationships among the ratings them- 
selves. The relevant data are presented in 
Table 1. Psychologists’ ratings were available 
on 95 of the 96 subjects, psychiatrists’ ratings 


Psychologist vs. 
relatives (64) 


Psychiatrist vs. 
relatives (43) 


.20 


on 64 subjects, and relatives’ ratings on 67 
subjects. Thus the various sets of correla- 
tions differ in N, and the numbers in paren- 
theses indicate these Ns. 

Concerning the correlations, six of the eight 
psychologist-psychiatrist correlations are sig- 
nificant, while in the remaining two columns 
only 3 of 16 correlations are significant. 
There is more agreement between psycholo- 
gists and psychiatrists than between relatives 
and either psychologists or psychiatrists. 

There appear to be two reasons for the low 
correlations among ratings. First, there were 
many raters and sources of information, and 
it may be assumed that each had his own 
implicit norms. Differences in such norms 
would lead to differences in ratings that would 
appear as unreliability; the result would be 
lower correlations. 

Second, there are probably differences in 
aggressiveness from one context to the next, 
e.g., a patient behaves differently at home 
from the way he does in the hospital. Such 
discrepancies in behavior would lead to dis- 
agreement among raters who evaluate the 
patient’s aggressiveness in different contexts. 
This has consequences for the relationship 
between self-report measures of aggressive- 
ness and the observations of others. If the 
patient differs in aggressiveness in different 
situations, this places a low ceiling on the 
relationship between the inventory and the 
ratings. 

The correlations between the inventory 
and the ratings are presented in Table 2. 
Only one of the psychological ratings was 
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significantly related to an inventory scale: 
Resentment. Since none of the other correla- 
tions in the Psychologist column is as high as 
.10, the Resentment correlation may be 
attributed to chance. 

The psychiatric ratings, on the other hand, 
are related to inventory scores; four of the 
seven inventory scales are significantly re- 
lated to psychiatric ratings. Evidently the 
vantage point of psychiatrists was better than 
that of psychologists, at least in relation to 
the patient’s self-reports. The psychiatrists 
observed patients during interviews, listened 
to their stories, and had access to case history 
materials; they may have placed consider- 
able weight on the patients’ own stories. 

While neither the psychologist-inventory 
nor the psychiatrist-inventory correlations 
showed sex differences, there were sex differ- 
ences in the relative-inventory correlations. 
Therefore the men’s and women’s correlations 
are presented separately in the last two col- 
umns of Table 2. None of the men’s correla- 
tions is significant; the women’s correlations 
are for the most part significant but not high. 

Tests. The correlations between the tests 
and the inventory are presented in Table 3. 
Again there are sex differences; for men 
Elizur hostile content scores are unrelated to 
inventory scores, but for women four of the 
inventory scores are related to the Elizur. For 
the IPIT the sex difference is in the opposite 
direction: significant correlations for men 
but not for women. The correlations between 


TABLE 2 


CORRELATIONS BETWEEN THE INVENTORY 
AND THE RATINGS 


Inventory versus 


Relative 
Psycholo- _Psychi- 

Mode of gist atrist Men Women 
aggression (95) (64) (22) (45) 
Assault 04 .22 .03 14 
Indirect —.09 14 31* 
Irritability .06 .36** .03 .29* 
Negativism .49** —.30 .24 
Resentment 35** .28* —.30 .36* 
Suspicion 04 -20 .16 .30* 
Verbal .03 .48** 07 .30* 
Sum —.03 19 —.17 .33* 


* Significant at .05 level. 
** Significant at .01 level. 


TABLE 3 


CORRELATIONS BETWEEN THE INVENTORY AND 
HostiteE CONTENT SCORES AND 
tHE IPIT 


Inventory versus 


Elizur IPIT 
Mode of Men Women Men Women 
aggression (33) (63) (27) (50) 
Assault 19 .26* .40* 18 
Indirect 21 .23 .28 
Irritability 17 ll —.01 
Negativism 15 .20 36 21 
Resentment .02 .20 .45* 
Suspicion —.01 .30* 49* —.24 
Verbal -12 .37** —.14 
Sum .10 41** .42* 10 


* Significant at .05 level. 
** Significant at .01 level. 


the IPIT and the Elizur hostile content score 
are .18 for men and —.40 for women. 
Laboratory tasks. The laboratory tasks 
were scrambled sentences and operant con- 
ditioning of hostile verbs. There were two 
scores for the latter task, operant level and a 
learning score. The learning score yielded no 
significant correlations with the inventory or 
any other measure, and it is omitted from 
further comparisons. The results for the 
scrambled sentences and operant level are 
presented in Table 4. For men one scrambled 
sentences-inventory correlation is significant, 
while for women all but one are significant. 
The correlations between the inventory and 
operant level show just the opposite results: 
fairly high correlations for men and very 
low correlations for women. The men’s corre- 
lations between the inventory and operant 
level were among the highest of the entire 
study. The sex differences carried through to 
the scores on scrambled sentences and operant 
level, in both instances men scoring higher 
than women. The difference is significant at 
the .05 level for operant level (¢ = 2.02, 
df = 64) but just misses the .05 level for 
scrambled sentences (¢ = 1.89, df = 88). 


Ratings, Tests, and Laboratory Tasks 


The three sets of ratings were correlated 
with the Elizur hostile content score, IPIT, 
operant level, and scrambled sentences. Since 
there were 21 ratings (three sets of seven 
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each) and each rating was correlated with 
four scores (two tasks, two tests), there 
were 84 correlations in all. Approximately 
four of these correlations would be signifi- 
cantly different from zero at the .05 level by 
chance alone, and nine correlations were 
actually significant at the .05 level. Since 
there is no way of knowing which correlations 
are reliably different from zero, the only re- 
course is to search for consistency. If a par- 
ticular scale had significant correlations with 
several variables, its relationships are more 
likely to be stable. Only two scales mani- 
fested consistency: Irritability and Negativ- 
ism, as rated by relatives, significantly corre- 
lated with the IPIT, scrambled sentences, 
and operant level. Thus patients’ Irritability 
and Negativism prior to hospitalization is 
related to measures of aggressiveness. 

Next the tests were correlated with the 
laboratory tasks, with the results kept sep- 
arate. None of the correlations between the 
Elizur hostile content scores and the two 
laboratory tasks is significant. The IPIT- 
scrambled sentences correlations are .60 for 
men and .35 for women. The IPIT-operant 
level correlations are .66 for men and .35 for 
women. 

The scrambled sentences-operant level cor- 
relations are .65 for men and .29 for women. 
The communality among the three measures 
is strong for men, and at least a weak rela- 
tionship is present for women. The correla- 
tions between the IPIT, scrambled sentences, 


TABLE 4 


CORRELATIONS BETWEEN THE INVENTORY AND 
SCRAMBLED SENTENCES AND OPERANT LEVEL 


Inventory versus 


Scrambled 
sentences 


Operant level 


Mode of Men Women Women 
aggression (28) 


Assault 
Indirect 
Irritability 
Negativism 
Resentment 
Suspicion 
Verbal 

Sum 


* Significant at .05 level 
** Significant at .01 level. 


TABLE 5 


Factor LoApINGs FoR PATIENTS’ 
INVENTORY SCORES 


Men (53) 


Women (114) 


Mode of 
aggression I 


Assault .28 
Indirect 35 
Irritability 63 
Negativism 34 
Resentment 63 
Suspicion 52 


Verbal 33 


and operant level may be explained by 
assuming that each represents a base (or 
operant) level of aggressive responding. On 
the IPIT the subject chose from four stories 
to each picture, with one of the choices being 
hostile. On scrambled sentences the subject 
chose between a hostile and a neutral un- 
scrambling of the sentence. On the learning 
task (before reinforcement) the subject chose 
a hostile or a neutral verb in constructing a 
sentence. Thus on all three tasks the subject 
chose between neutral and hostile alterna- 
tives, his score representing a base level of 
aggressive or hostile responding. 


Factor Analyses 


Inventory scores were available not only 
for the 96 patients of the project but for 71 
additional psychiatric patients; with the large 
number of subjects it was possible to keep 
the sexes separate. For the psychiatrists’, 
psychologists’, and relatives’ ratings separate 
analyses were not necessary because the sex 
differences in correlations were small and un- 
systematic. 

The 650 computer was used, and factors 
were rotated to an orthogonal solution by the 
varimax method. In order to make all the 
data comparable, the original inventory data 
on 85 college men and 88 college women 
(see Buss & Durkee, 1957) were again factor 
analyzed and rotated by the varimax method. 
The factor patterns are identical to those of 
the previous study: Factor I is defined by 
Assault, Indirect, Irritability, and Verbal; 
and Factor IT is defined by Resentment and 
Suspicion. This pattern holds for both sexes, 
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TABLE 6 


Factor Loapincs or PsycHOLOGISTS’, AND RELATIVES’ RATINGS 


Psychiatrists (69) 
Mode of 


aggression I Il 


Psychologists (95) Relatives (67) 


I II I II 


Assault — .07 
Indirect 71 
Irritability 81 16 
Negativism 68 17 
Resentment 39 49 
Suspicion 59 
Verbal 82 


48 54 02 
54 33 84 
81 14 77 32 
-10 61 .60 38 
.68 07 
.29 .29 40 
68 22 82 04 


but for women Negativism loads up on 
Factor I. 

The inventory data for patients are pre- 
sented in Table 5. The criterion of meaning- 
fulness is the same as in the previous study: 
loadings of .40 and above. It is difficult to 
discern a pattern in the loadings above .40. 
For men Factor I is defined by Irritability, 
Resentment, and Suspicion; for women it is 
defined by Irritability, Resentment, and 
Verbal. For men all the scales have loadings 
greater than .40 on Factor II, whereas for 
women only four scales have loadings greater 
than .40. Thus for the men Factor II is a 
general factor, including all aspects of ag- 
gression and hostility, whereas for the women 
there is no general factor. However, the 
women’s pattern for Factor II does not seem 
to make much psychological sense, nor does 
it conform to the general factor of the men. 
In brief, the factor analyses of the patients’ 
inventories yield a confusing picture that 
suggests no clear interpretation. 

The factor analyses of the ratings by psy- 
chiatrists, psychologists, and relatives are 
presented in Table 6. For psychiatrists, 
Factor I is defined by Assault, Indirect, Irri- 
tability, Negativism, and Verbal; Factor II 
is defined by Resentment and Suspicion. This 
pattern is identical to the pattern of college 
women taking the inventory. For psycholo- 
gists, Factor I is defined by Assault, Indirect, 
Irritability, and Verbal; Factor IT is defined 
by Negativism, Resentment, and Suspicion. 
This pattern is identical to that of the psy- 
chiatrists’ ratings, with the exception of 
Negativism, which now loads up on Factor 
II. Thus except for Negativism, the factor 


pattern for psychologists’ ratings resembles 
that for both psychiatrist’s ratings of patients 
and college students’ inventories. 

Concerning relatives, Factor I is defined by 
all the variables except Suspicion, which de- 
fines Factor II. This factor pattern differs 
from the others in that Resentment now 
loads up on Factor I, whereas in the patterns 
for psychiatrists, psychologists, and college 
students Resentment combined with Sus- 
picion to define Factor IT. 

These various factor patterns reveal both 
inconsistencies and consistencies. First, Nega- 
tivism appears in Factor I, Factor II, or 
neither, depending upon the source of in- 
formation; Negativism is an unstable scale. 
Second, the factor analyses of the patients’ 
inventories yield ao discernible pattern. Third, 
with the exception of the patients’ inven- 
tories, all sources of information yield a factor 
pattern that is more or less consistent: 


AGRESSIVENESS HOSTILITY 
Assault Resentment 
Indirect Suspicion 
Irritability 
Verbal 


The two factors, Aggressiveness and Hos- 
tility, account for the loadings over .40, 
with the exceptions noted. They show stabil- 
ity across sources of information: self-reports 
of normals and ratings of patients by psychi- 
atrists, psychologists, and relatives all yield 
more or less the same two factors. The factor 
analyses of patients’ inventories did not yield 
the same factorial structure, which may be 
attributed to distortions by the patients in 
describing their own behavior. Thus the evi- 
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dence for the factors of Aggressiveness and 
Hostility is clearly positive. 


SUMMARY AND CONCLUSIONS 


This study yielded several salient findings. 
The ratings by psychologists, psychiatrists, 
and relatives tended to be unrelated or only 
weakly related perhaps because Aggressive- 
ness and Hostility are not pervasive person- 
ality characteristics that can be assessed with 
consistency across situations. A network of 
empirical relationships was established be- 
tween the Buss-Durkee inventory and psychi- 
atric ratings, relatives’ ratings, projective 
measures of aggression, and laboratory tasks 
involving hostile materials. Both tests (IPIT 
and Elizur) were found to correlate with the 
inventory. The IPIT yielded higher correla- 
tions, and in addition it was significantly 
related to both laboratory tasks, whereas the 
Elizur was not. 

Conditioning of hostile material was unre- 
lated to any of the other measures used in the 
study, but the operant level of emitting hos- 
tile verbs was found to correlate with a num- 
ber of measures and was one of the best 
measures of the entire study. The scrambled 
sentences were also related to several other 
measures. Both these laboratory tasks tap 
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some base (operat) level of emitting aggres- 
sive responses, and they may be better meas- 
ures of aggressiveness than ratings made on 
the basis of interviews or a battery of tests. 

Finally, the factor analyses yielded fairly 
consistent factor patterns. The patients’ in- 
ventory scores yielded no meaningful patterns, 
but the psychiatrists’, psychologists’, and 
relatives’ ratings all yielded an Aggressiveness 
factor and a Hostility factor. 
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The clinical literature contains a number of 
reports on the mental status of patients fol- 
lowing cerebral hemispherectomy. In many 
cases clinicians have been struck by the re- 
markable preservation of higher mental func- 
tions following so radical an operation. The 
literature on the subject was summarized by 
Mensh, Schwartz, Matarazzo, and Matarazzo 
(1952), and by Gardner, Karnosh, McClure, 
and Gardner (1955). We had an opportunity 
to interview and to test one such patient and 
would like to report our findings and observa- 
tions. 


CasE History 


The patient, a 39-year-old white man, was 
admitted to Western Reserve University Hos- 
pitals on October 14, 1954. He had been in 
good health until August 1954, when he de- 
veloped headaches which became progres- 
sively more severe. 

On admission positive findings were con- 
fined to severe papilledema, and some loss of 
fine movements of the left hand. Skull films, 
electroencephalogram, and _ ventriculogram 
gave evidence of a right frontotemporal mass. 
On October 18, 1954, a craniotomy was per- 
formed and a tumor was removed from the 
right frontotemporal area. Histopathologic 
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from these analyses and thus we may have un- 
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the projective material. The complete analyses, 
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diagnosis of the tumor was astrocytoma, 
grade four (glioblastoma multiforme). The 
patient had an uneventful recovery and was 
discharged on October 30, 1954, without 
demonstrable neurologic effects, except for a 
left superior quadrant homonymous visual 
field defect. 

The patient was readmitted on November 
25, 1954, for a right hemispherectomy and, on 
this admission, the only change in his exam- 
ination since discharge was a recurrence of his 
papilledema. On November 26, 1954, a right 
hemispherectomy was done. This included the 
globus pallidus, putamen, and _ caudate 
nucleus, and a considerable portion of the 
thalamus. The major arterial supply was 
ligated just distal to its connection to the 
circle of Willis. 

Postoperatively the patient had a left 
hemiplegia, disturbance of body consciousness 
on the left with loss of localization of stimuli 
applied to the left side. During the first few 
days after the hemispherectomy the patient 
refused to acknowledge his paralyzed left side. 
When he was made to touch his left hand or 
leg he insisted the hand and leg belonged to 
someone else. This symptom, encountered 
frequently in patients after damage in the 
right cerebral hemisphere (Weinstein & Hahn, 
1955), subsided after about 3 weeks. During 
the first 3 postoperative weeks the patient was 
deeply depressed, drowsy, and almost lethar- 
gic. Slowly, however, he started to take inter- 
est in his environment. He commenced physi- 
cal therapy, and after 60 days of convales- 
cence, he became ambulatory with a left lower 
leg brace. He was able to localize stimuli to 
the general areas of the left side. Body im- 
age was described normally and he was able 
to differentiate touch, vibration, and pain 
stimuli with 60% accuracy on the left, al- 
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though he stated that all were variations of 
pain. On the seventy-first postoperative day, 
the patient was discharged. 

The patient continued to improve as far 
as ambulation and self-sufficiency were con- 
cerned until 3 weeks before his last admis- 
sion (April 10, 1955). At that time, he be- 
gan to have pain in his right eye, with some 
loss of sensation over the first division of 
the right fifth nerve. Six days before admis- 
sion, he began to have nausea and vomiting. 
On this admission, ventricular tap revealed 
increased intracranial pressure and increased 
protein (800 mg. percent). Right carotid 
arteriography revealed displacement of the 
left anterior cerebral and anterior communi- 
cating arteries, consistent with tumor re- 
growth. A ventriculogram revealed a space 
occupying mass in the right cranial vault. He 
was discharged without therapy and progres- 
sively deteriorated until his death on April 25, 
1955. Unfortunately permission for a post- 
mortem examination of the brain of the pa- 
tient was not granted by his family. 


OBSERVATION OF BEHAVIOR 


Seventy-one days after the hemispherec- 
tomy the patient was discharged from the 
hospital. At home the patient apparently tried 
to overcome his depression by joking about 
his condition and making plans for his future. 
He felt that he could return to productive 
work in his former occupation as a machine 
designer. His wife reported that he did not 
show concern about their precarious financial 
situation. He did not ask how she managed to 
pay for household and hospital bills. On the 
contrary, he was puzzled as to why she had 
to spend endless hours in the kitchen making 
chocolates for a candy store in town, and he 
was annoyed that she did not spend all her 
time with him. 

As he recuperated from the operation, he 
became increasingly preoccupied with sexual 
matters. On his return home he made fre- 
quent sexual advances to his wife but found 
himself impotent. He exposed his genitals 
freely, usually when he was alone in his chair, 
but he was quite unconcerned when someone 


entered the room. He also made sexual ad- 
vances to a female neighbor who was helping 
his wife care for him. He felt very badly at 
“not being a man any more.” He told his wife 
that since he could no longer have sexual 
relations with her, he might as well die. 

During the last weeks of his life the pa- 
tient showed on several occasions striking 
distortions of time sense. After his wife had 
just left the room she would be called back 
by the patient. The patient would cry and 
complain bitterly that he had been left alone 
and unattended for hours. 


PSYCHOLOGICAL TESTING 


We saw the patient for psychological test- 
ing during March of 1955 in the fourth month 
following hemispherectomy. Testing was done 
in his home, and because the patient tired 
easily, testing was spread over four afternoon 
sessions. Five subtests of the Wechsler- 
Bellevue Intelligence Scale (Form 1), the 
Rorschach, the Draw-a-Person Test, and the 
Bender Gestalt test were administered. The 
patient was extremely cooperative and eager 
to help us and science. 

Wechsler-Bellevue Intelligence Scale. Un- 
fortunately we were not able to give to the 
patient the complete test. On the five subtests 
we did administer the patient scored as 
follows: 


RAW SCORE WEIGHTED SCORE 


Verbal scale: 
Information 14 
Similarities 17 
Vocabulary 33 


Performance scale: 
Picture Completion 7 
Object Assembly 8 


If we prorate for the missing subtests we 
arrive at a verbal IQ of 121, and a perform- 
ance IQ of 76. 

Rorschach test. The Rorschach test was 
completed in one session and was recorded on 
tape. The patient’s performance on this test 
gives a better picture of his higher mental 
functions than any other observations we 
could contribute. We, therefore, reproduce 
the protocol in its full length. 
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Subject: R. G. (Male) 
Age: 39 
Date: March 22, 1955 


3” 


1. Well, that might be some coral formation. I (Q: You said it might be some coral formation. 
have never seen any actually except in pictures. Something you might have seen in pictures. Can 
(The subject hands card back; the experimenter you tell me how much of the blot was the coral 
asks the subject to look some more to see if he formation?) 

see anything else there.) 


S: This thing here—was not the coral formation. 
(When we get to it you tell me.) 


2. Well, it could almost be a skull, but the eye is 2. (You said it might be a skull.) Because of these 


missing. (O.K.?) O.K. two empty spaces, looks like the hollows of the 
eyes and the teeth that might have been across here. 
60” (But the whole thing was the skull?) Yeah. (If you 


see anything else when we go along you might tell 
me about it.) 


1. Looks something like a pelvis bone—but it isn’t. 1. (You said something looked like a pelvis bone. 
(Long pause. Q: What else does it remind you What was that?) 

of, anything ?) These two halves here—the two sides of the pelvis. 
And this part here resembles the part where the 
spine is attached to it. 


2. Can’t think of anything. 2. (You said you couldn’t correlate the red stuff. 
(It does not have to be exact or perfect, just Does it look like anything to you now?) No. 
whatever it might remind you of.) (You said the black could look like a bat. Where 


I am trying to correlate that red stuff in there but is that?) 

it doesn’t ring any bell. The red throws it off These two halves would be the wings. These two 

altogether. (What would it be without the red?) little points would be the hooks on the front of the 

Well, that black section looks something like a bat. wings and these are the feet back of the wings by 
which the thing hangs upside down. 


120” 


Were those things created accidentally or by— 
drawn up? 


1. Well that red in the center could be a butterfly. 1. Because these look like butterfly wings but there 


is no true body in it. (Anything else make you think 
of a butterfly ?) 

Pause. (I was wondering if it were gray, would it 
still look like a butterfly?) Well, it’s a pink coral 
color, not gray. (Do you think of a pink butterfly?) 
No, I don’t think the color had much to do with it. 
Shape of it did, though. 


2. And a sea horse in the other red. 


2. (Q: Shape and color of sea horse.) Yes. 


3. Well, this part is the same as in the other pic- 3. Looks like a pelvis—these two sections here, this 
ture. (Q) That lower center section. This part to the center where the spine would come into it. 
here is the same as the other one there. (What Coccyx region. 

does it look like or remind you of?) Well, 4 (You said this could be a femur. Why?) Oh, that’s 
pelvis. Is that correct? (Um hum, anything else? the relationship to the pelvis. That would be one 
Actually there are no real right or wrong an- side or the other. There are two of these, you know. 
swers). As long as it is a pelvis, this could be a (Because of the relationship?) Yes. It’s also long 
femur. (Why is that? You say as long as that’s and thin like a femur, except a femur doesn’t have 
a pelvis?) Well that’s because that’s the relation- this projection on it, normally, that is. 

ship. The bones. (Anything else?) 
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There is another red spot right here. It doesn’t 
make any picture at all. Except the reverse of the 
opposite one. There is a sea horse on the other 
side. (Is this a sea horse too, then?) Yeah, this 
could be a sea horse. Maybe because the color is 
very much related. (How is that?) Well a sea 
horse is about the same color as that kind of a 
pink. (I remember seeing a sea horse in an 
aquarium, but it was pickled. Are they a pink 
color ordinarily?) Yes, coral color. Little deeper 
than a true pink. They cannot be kept alive in 
captivity. (Is that right?) If you had an 
aquarium they wouldn’t live in it. You have 
seen the dead ones. 


150” 
IV 2” 


1. Looks like a bearskin. 
(Anything else?) That’s all. (Hm) Can’t see 
anything else there (the subject keeps card). 


2. This spot looks like a hippopotamus or a 
rhinoceros. (This part down here?) Yes, this head 
part here. D1 

Seems to me I've seen these before somewhere. 


50” 


V 8” 


1. Looks like a skinned rabbit. It’s opened up and 
its ears sticking up and his feet stretched out 
behind him. 


30” 


VI 12” 


It doesn’t make any sense at all. 
(Do you see anything here?) 

No (the subject keeps card). 

1. Except, maybe, it’s a bear skin. 
(Okay ?) Okay. 


60” 
VII 4” 


1. Um, could be a crab. 
(Pause. Q: Remind you of anything else?) No. 
(Long pause. Q: You look like you are about to 
say something. Are you?) 


1. Well the shape of it resembles it and also the 
color. Dark gray-black sort of color. (And that 
makes it look like a skin?) Yeah. 


2. (Then you said a hippo or rhino head?) These 
projections look like the horns on a rhinoceros or 
the general shape here is flat like a hippopotamus. 
There’s a couple of eyes right here. Looking down 
on top of it. 

The subject reacts to a humming noise and remarks: 
That sound you hear is a washing machine. It is not 
my heart. 


(You said this looked like a skinned rabbit with 
ears opened up. Can you tell me a little bit about 
how you see that?) Well, with the legs stretched 
out behind and the two ears up here and the 
other two legs here. (Q: opened?) Yes, split down 
the middle and then opened up and laid on its 
stomach side. (How can you tell?) The relation to 
the feet and head. 


1. Well there’s a foot here, looks like the hind foot 
of a bear and this thing by a stretch of the imagina- 
tion could be one of the front paws. These could be 
the ears here, or one of the ears. (You said on the 
other one something else beside shape. Is there any- 
thing else here?) Coloring resembles it. 


1. (You said this could be a crab. Can you tell me 
how you saw that?) The body and the claws. 
(Here?) Correction please—this white part would 
be the body and these the claws. (Space in center is 
body and top are the claws. Q: Now, how are they 
attached? How much of this are claws?) This part 
right here. (Both sides, or one side?) Both sides. 
Actually, a crab has two more claws in the back 
This one has lost his in a battle someplace but he'll 
grow new ones eventually. 
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2. I was going to say it has a faint resemblance to 2. (Then you said there was a faint resemblance 
a flying squirrel but it actually doesn’t. to a flying squirrel.) Well, with these membranes 
stretched out, they look kind of thin here, like the 
skin would be stretched out. No such thing a flying 
squirrel. Actually it’s a gliding device used to jump 
off one tree and glide to another. (And what made / 
you think of that, this bottom part?) The thinning 
out of the colors here, looks like the membrane 
stretched out against the feet. (How much of the 
squirrel do you see?) Well if the feet were stretched 
out along here, which you don’t see any feet of 
course. (It’s really this shading that gives you the 
impression of skin being stretched.) Yes. 


3. This part looks like a stabilizer on an airplane. 3. The outline here. (Q) Cause it looks like a 


(Bottom detail) stabilizer on an airplane. The shape it’s not a true 
shape of a stabilizer because it doesn’t lend itself to 
185” an air foil section properly. (What is a stabilizer?) 


The wing section on the back of the plane (tail?) 
The appendage. It keeps the back end of the 
plane level with the front. 


2" 


1. Looks like a skeleton of some kind of fish. 1. (You said you saw a skeleton of some kind of 
(D 3) fish. Now where do you see that?) Well these 
little members across here could be some type of 
rib connections. This could be the main body of the 
skeleton. (The blue.) Yeah. This could be maybe the 
head or skull whatever you call it. 


2. That red looks like a flower that is starting to 
open up—this part—not that part (Bottom 
center part?) Yeah. 


2. Looks like the petals of a flower and the colors. 


3. This part looks like that animal that shoots 3. This was an animal, can’t think of its name you 
its tongue out at insects. (D 1) have seen it in movies probably, that shoots out its 
tongue against an insect to capture it. (Chameleon?) 
It’s not a true chameleon. (Gila monster?) No, a 
gila monster is not as long as this. Different shape 
or color too. (Is this the right color?) No, it’s 
strictly the shape of the thing. 


4. This part here looks like the picture I’ve just 4. (Shows VII) This here is the same as this. (The 
seen. This section right in here. (You mean it blue here?) Very much like this bottom part in 
looks like the last ink blot?) Yeah. (Just the that card. (VII) 

middle part you mean here?) Yeah. (Doesn’t it 

look like anything special?) No. 


5. Well, it looks like a couple sails on a wind 5. (Couple of sails on a wind jammer, where are 


jammer. (Center blue) they?) (Blue) Right here. In technicolor. (Q) No, 
(Okay?) Okay. only in the movies. (So actually it’s the shape?) 

Well, they make them a special color on those small 
153” star class sailing vessels. But these type of sails 


would not be suited for that. Square riggers is what 
they call them. (So it’s mostly the shape here?) 
Yeah. The shape and the hollow—fading away of 
color here gives you the impression that it’s full 
of wind. 
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IX 2” Pretty colors 23” 


1. Like the head of a moose. (Top projections) 


2. This is the head of a person here—side view 
of him. With the head and the body. But this 
has no correlation at all (points to center bottom 
detail). 


67” 


1. This insect here—looks like an insect, what 
do you call them, black beetles? (Top detail) 


2. This again looks like a flower. 


3. This could be a sea worm of some type, 
who ... . name I cannot say. 


4. This could be some kind of a spider. (D 1) 


5. If this were a different coler it could be a 
polar bear. 


6. This thing here looks like the weight of a 
centrifugal type governor. (Center detail) 


7. This thing could be a tree right here growing 
up. (Top) 


8. If this body was thinner it would look like a 
praying mantis. It’s too fat for that type of 
bug. How many legs does a praying mantis have? 
(Six) Then it’s an insect. Well, let’s look at it 
(Counts legs). This one doesn’t have six legs. 
(Side brown) 


230” 


1. (You said it looks like the head of a moose. 
What reminded you of that?) These things look like 
the antlers. Then there is a couple of eyes. (These 
little white spots here?) This here, grayish color in 
here could be the nose part of it. 


2. (Then you saw the head of a person. Sort of a 
side view) Right here is the head—here the nose, 
the mouth right in here. There is the body down 
here. It isn’t a true body either. (What do you 
mean?) Well, it’s too squat, not long enough. 
Unless it’s a Japanese type of person, squatting 
down. (You said this has no correlation?) No. 


1. Yeah. These things look like the pinchers of a 
beetle. These could be the legs dangling out back 
there (Q). He’s attacking this thing whatever it is. 


2. (Side yellow) This dark gray part looks like that 
section where the flower grown out of. Those are 
the petals. And the color helps. 


3. Then the sea worm. (Bottom green) 


4. All these little thin sections look like legs. The 
body right through there. 


5. Don’t forget the polar bear, if it was white! 
There’s the head and there’s a small tail on the bear. 
(And these are the feet?) Yeah, curled up under- 
neath him. (You say they are curled up?) Yeah, up 
under his body. 


6. These two balls are weights on the thing and it’s 
rotating around this axis here and it opens as it 
whirls, it opens up it pulls up the center section 
through the shaft that governs the speed of the 
motor by opening and controlling the throttle on 
the carburetor, butterfly valve. (Spinning?) Well 
they are approaching, they are not quite out. Per- 
haps you remember the weights on the old phono- 
graphs. They use them on steam engines too, and 
auto engines. It isn’t the most common type. The 
common type they use on truck engines is the air 
type. It keeps the engine from overspeeding, that’s 
when an engine goes to hell, when it’s overspeeding. 
It just knocks itself out. Vibration tears it down. 
That’s why you should always keep an engine 
under load. 


8. These two prongs here on the front paws or 
claws, whatever you call them. These things up here 
are the legs holding it up. (Q) (Standing up?) 
Yeah. (Side gray) 
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Fic. 1. Patient’s drawing of a person. 


Drawn-a-Person test and Bender test. The 
patient’s drawing of a person and his repro- 
ductions of the forms of the Bender Gestalt 
Test are shown in Figures 1 and 2. 


BLInD ANALYSES OF PROJECTIVE TESTS 


We were faced with the problem of ob- 
taining an interpretation of the projective test 
material unbiased by a knowledge of the 
presence of such gross brain pathology. We, 
therefore, asked three authorities in their re- 
spective fields to undertake blind analyses of 
the test protocols. The only information 
given was that the subject was a 39-year- 
old male. 

The patient’s Rorschach protocol was sur- 
prising in many respects. It was longer and 
more complex than one would expect from 
an individual with such extensive brain loss. 
We are obliged to Marguerite Hertz for a 
detailed blind analysis of the Rorschach 
protocol. Only a few lines taken from Hertz’s 
analysis shall be quoted here: 


The subject functions at a most mediocre level. 
While his thinking falls within the limits of ac- 
curacy there is evidence of considerable weakness 
in reality testing. Trends towards fluidity and 
arbitrariness of percepts and ideas appear too fre- 
quently in so short a record. The patient is con- 
cerned with bodily status, sexuality, potency. He 
seems filled with anxiety ridden fantasies of being 
torn, mutilated and/or deformed. Potentiality for 
feelings of unreality have been indicated. He shows 


strong potentiality for bizarre departures from 
realistic thinking and conventional logic. While sub- 
ject may show a superficial semblance of being 
organized, he is schizophrenic-like in his handling 
of relationships and in his methods of coping with 
the complexities of reality. Paranoid features are 
likewise dominant in the record. Finally, there are 
many indications of organic impairment that should 
be investigated. 


A blind analysis of the patient’s drawing 
of a person was performed by Rochelle 
Wexler, and with her permission we present 
an excerpt from this analysis: 


Regarding his intelligence, his delineation of such 
sophisticated forms as the trapezoidal ears, as well 
as the whole “feel” of the drawing, suggests a man 
of good intelligence. He is so neat, so carefully or- 
ganized, and his techniques can be carried out with 
such consistency that it would not surprise me to 
learn that he is working effectively. He achieves an 
almost too perfect mechanical integration. However, 
he achieves this integration in a highly arbitrary 
manner. He gets along in a mechanical, robot- 
like fashion. He has worked out his mode of adjust- 


Fic. 2. Patient’s reproductions of the forms of the 
Bender Gestalt test. 
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ment in such a rigid manner that any deviation 
would severely threaten his equilibrium; his defenses 
are so brittle, and his present adjustment is so 
tenuous that he could easily “go to pieces” under 
any stress. 


Wexler notes many immature and childish 
features, major difficulties in the area of 
handling aggression, paranoid ideation, feel- 
ings of sexual impotence, tremendous feelings 
of personal inadequacy, depressive features. 
She feels that many features of the drawing 
“strongly suggest schizophrenia.” However, 
“if he is schizophrenic, I would suspect that 
the pathology has an underlying organic 
basis.” 

The drawings of the Bender Gestalt Test 
were examined by Gerald R. Pascal. In his 
analysis, Pascal notes among other things “a 
stickiness and perseverative quality to Design 
1 which may or may not be due to pervasive 
organic disorder” but then again he labels 
“organicity” as only a “remote possibility,” 
and he feels that “we are dealing with an 
individual whose manifested behavior is 
mostly in contact but who is suffering from 
mood swings with underlying aggression, dis- 
trust, some suspiciousness of the environment, 
and whose obsessive-compulsive defenses 
seem to be no longer efficient. . . .” 


DISCUSSION 


We met the patient in the fourth month 
following hemispherectomy. We were sur- 
prised to find a man who could discuss his 
condition in a seemingly detached and objec- 
tive way. Our surprise was shared by most 
other people who met the patient. Had we 
had to base our judgment only on the per- 
sonal contacts we had with the patient we 
would have stated that his higher intellectual 
functions survived hemispherectomy unim- 
paired. Examination of the test results, how- 
ever, revealed a different picture. While some 
functions appeared remarkably preserved, 
others were markedly impaired. 

Many authors have commented on the 
“concreteness” of brain injured patients. Such 
patients are said to live in a world of concrete 
objects rather than in a world of abstract 
ideas. Such concreteness was not obvious in 
this patient. His performance on the Similari- 
ties test was superior (IQ-equivalent 128). 


Most of his definitions on the Vocabulary 
test were abstract. Objects were defined in 
terms of their membership in a wider class 
rather than in descriptive terms. 

Another indication of the “loss of the ab- 
stract attitude” is seen in the alleged in- 
ability of brain injured patients to plan their 
activities. If one considers the short term 
planning that can be observed in conversa- 
tions or on psychological tests, loss of plan- 
ning ability was not evident in this patient. 
Explanations the patient gave were orderly, 
one step in the explanation prepared the next 
step, and so on. An example of this is his 
explanation of the functioning of a centrifugal 
governor seen on Card X of the Rorschach. 
If, on the other hand, we consider long range 
planning, the patient showed signs of impair- 
ment. We refer to the observation reported 
above concerning the patient’s disinterest in 
and lack of understanding of his financial 
situation, and also his lack of concern for the 
possible social consequences of his changed 
sexual behavior. He had become immodest 
in the presence of neighbors, and he did not 
seem to care whether they disapproved of it. 

The perceptual functions of this patient had 
definitely suffered. Before his disease the 
patient had worked as a machine designer, 
and his last employer told us that the patient 
had done work which called for inventiveness 
and imagination. It appears inconceivable 
that at the time the patient designed machine 
parts, he would have performed as poorly on 
the Object Assembly test or the Picture Com- 
pletion test as he did when we tested him. 
For example, in putting together the manikin 
of the Object Assembly test, the patient re- 
versed the position of the two legs of the 
manikin without becoming aware of it, and 


‘on the Picture Completion test he identified 


the missing part only in 7 out of the 15 pic- 
tures. This failure cannot be attributed to the 
patient’s hemianopsia as the pictures of this 
test are small enough to have been seen by 
the patient in their entirety. In the macular 
area patient’s vision extended 2 to 3 degrees 
around the fixation point, and the patient had 
a corrected acuity of 20/20 in both eyes (Volk 
& Bruell, 1956). On the positive side, the 
patient’s performance on the Bender Gestalt 
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Test must not be overlooked. The patient’s 
drawings were not disorganized, and in the 
blind analysis of these drawings organicity 
was considered a “remote possibility” only. 
Thus, as in the area of abstract behavior, 
we have here this peculiar mixture of loss 
and preservation of functions. 

The situation is similar when we consider 
the recall functions of the patient. On the 
one hand, we have the superior performance 
of the patient on the Vocabulary test (IQ 
equivalent 128), and the rich vocabulary 
displayed in the Rorschach protocol. By con- 
trast, we find that on the Information test 
the patient scored just average (IQ equiva- 
lent 106). We were struck particularly by 
the lack of balance in the things he knew 
and did not know. Thus he would state that 
the airplane was invented by “Michelangelo 
and more recently by the Wright brothers,” 
but he would estimate the population of the 
United States to be five million people. The 
patient’s performance on the Rorschach also 
indicated some weakness in memory func- 
tions. He failed to recognize cards he had 
seen before, and then again he believed to 
have seen cards already which were shown 
to him for the first time. 

Each of the three analysts of the projec- 
tive test material considered the possibility 
of organic impairment, but each also noted 
features reminiscent of paranoia. The ques- 
tion arises whether the schizophrenia-like 
features of the patient’s performance can 
be attributed to hemispherectomy. Unfortu- 
nately, this question cannot be answered 
with any degree of assuredness in the case 
of this particular patient. Close relatives of 
the patient including his mother and a 
brother had gone through schizophrenic epi- 
sodes or had been hospitalized for schizo- 
phrenia, and there are also all indications 
that the patient’s premorbid adjustment was, 
to say the least, schizoid. We would suggest 
that possibly the shock of the operation had 
activated in him latent schizophrenic ad- 
justment tendencies but we would hesitate to 
relate schizophrenia-like features of his be- 
havior to the removal of brain tissue. On the 


contrary, we would point out as remarkable 
that adjustment mechanisms mindful of 
schizophrenia which had existed preopera- 
tively had survived hemispherectomy. 


SUMMARY 


The case of a 39-year-old white male is 
reported whose right cerebral hemisphere was 
removed for tumor. Five subtests of the 
Wechsler-Bellevue Intelligence Scale, the 
Rorschach, the Draw-a-Person Test, and the 
Bender Gestalt Test were administered post- 
operatively. Test results are reported and 
discussed. The case confirms earlier findings 
of a remarkable, but by no means complete, 
preservation of higher mental functions fol- 
lowing cerebral hemispherectomy for tumor. 
The patient showed a perfect retention of 
language, and he had no obvious difficulties 
in forming abstract concepts. Short term 
planning was preserved but there were signs 
of impairment of long range planning ability. 
There were also indications that the memory 
for remote and recent events had suffered. 
Perceptual functions were definitely altered. 
A marked loss of social inhibitions in the 
sexual adjustment of the patient was evident. 
Because of the schizoid premorbid adjustment 
of this patient, it was impossible to say 
whether some of the schizophrenic-like fea- 
tures in his postoperative behavior were a 
consequence of the operation or whether they 
persisted in spite of it. 
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THE FIRST FACTOR OF THE MMPI: 


SOCIAL DESIRABILITY OR EGO STRENGTH? 


ALLEN L. EDWARDS anp LOUISE B. HEATHERS 


University of Washington 


Kassebaum, Couch, and Slater (1959) have 
reported the results of a factor analysis of 32 
MMPI scales. They interpret their first fac- 
tor as “Ego-Strength vs. Ego-Weakness.” No 
reference is made to Fordyce’s (1956) inter- 
pretation of the first factor of the MMPI in 
terms of social desirability nor was Edwards’ 
(1957) Social Desirability (SD) scale in- 
cluded in the analysis. The present study is 
concerned with the relationship between the 
first factor loadings of 31 of the 32 MMPI 
scales, as given by Kassebaum, Couch, and 
Slater, and the correlations of these scales 
with the SD scale.* 

Correlations of 19 of the MMPI scales with 
the SD scale were available from other sources 
(Edwards, 1957, 1961). The remaining 12 
correlations were obtained using the Merrill 
and Heathers (1956) sample of 155 males. 
Figure 1 shows the first factor loadings of the 
scales plotted against the correlations of the 
scales with the SD scale. The product-moment 
correlation between the factor loadings and 
the SD correlations is —.985. It is obvious, 
then, that the magnitude and sign of the first 
factor loadings of these MMPI scales could 
have been quite accurately predicted from the 
zero-order correlations of the scales with the 
SD scale. 

To obtain a high score on the SD scale, a 
subject must give a relatively large number 
of socially desirable responses to the items 
comprising the scale. A socially desirable re- 
sponse is defined by Edwards (1957) as a 
True response to an item with a socially de- 
sirable scale value or a False response to an 
item with a socially undesirable scale value. 


1A list of the scales is given by Kassebaum, 
Couch, and Slater. One of the 32 scales, the Social 
Presence scale, was not included in our analysis be- 
cause of an apparent discrepancy in the scoring key 
used by Kassebaum, Couch, and Slater and the scor- 
ing key for this scale as given to us by Harrison 
Gough. 
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Edwards, Heathers, and Fordyce (1960) and 
Edwards (1961) have shown that the corre- 
lation of an MMPI scale with the SD scale 
is a function of the percentage of items in 
the MMPI scale which are keyed for socially 
desirable responses. For the 31 scales of the 
present study, the product-moment correla- 
tion between these percentages and the first 
factor loadings is —.932. 

To interpret the first factor as “Ego- 
Strength vs. Ego-Weakness” may be more 
satisfying from the viewpoint of personality 
dynamics, but, on the basis of the evidence 
we have reported, we believe it more reason- 
able to interpret the factor as “Social Desir- 
ability vs. Social Undesirability,” that is, as 
representing the tendency to give socially de- 
sirable vs. the tendency to give socially un- 


desirable responses to personality items of 
the True-False type. Regardless of which in- 
terpretation one chooses, the fact remains 
that the SD scale provides an objective and 
easily obtained measure of the factor. 
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Fic. 1. Relationship between first factor loadings 
of 31 MMPI scales and correlations of the scales 
with the SD scale. 
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AGE AS A VARIABLE IN AVERSION TOWARD 
FOOD AND OCCUPATIONS 


HENRY CLAY LINDGREN 


San Francisco State College 


We are all familiar with the person who 
“knows what he is against, but doesn’t know 
what he is for.” Such an attitude is usually 
considered characteristic of at least a minor 
degree of emotional disturbance. One aspect of 
this negativistic attitude was measured by 
Wallen (1948), among others, who found that 
servicemen discharged on psychiatric grounds 
reported a greater number of food aversions 
than did a control group. 

It is the purpose of this paper to explore 
some of the relationships between age and 
negative choice and to show that the pro- 
pensity toward making negative choices, 
albeit characteristic of certain neuropatholog- 
ical syndromes, is also characteristic of in- 
dividuals in their late teens, as contrasted with 
adults in their twenties and older. The hy- 
pothesis to be tested is that younger people 


would reject more foods than older people, 
and that this age-related pattern of rejection 
would apply to occupations, as well as to 
foods. 


Two questionnaires were prepared, the one 
consisting of a list of foods, and the other 
of occupations.' The food questionnaire in- 
cluded the 28 foods used in studies by Altus 
(1949) and Wallen (1948), supplemented 
by 11 others, selected because of their rela- 
tively rare appearance in the standard middle- 
class, American diet and added for the purpose 
of increasing the reliability of the question- 
naire. The occupations consisted of 25 male 
and 26 female jobs, selected for their tedious- 
ness, low status, or unpleasant connotations 
or associations. 


1 Questionaires have been deposited with the 
American Documentation Institute. Order Document 
No. 6939 from ADI Auxiliary Publications Project, 
Photoduplications Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 
for microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 
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The questionnaires were administered to 
392 college students, who were asked to in- 
dicate which foods they disliked so much they 
would refuse to eat them, and which jobs 
(appropriate to their sex) they would not do 
under any circumstances. Scores were com- 
puted by counting the number of aversions. 
The reliability of the food questionnaire 
proved to be .84; reliabilties for the male and 
female portions of the occupations question- 
naire were .84 and .86, respectively. 

The mean number of food aversions for 
men 21 and under was 7.16; with a standard 
deviation of 4.12, while for men over 21, the 
mean was 4.41, with a standard deviation of 
2.95. Women 21 and under had a mean of 
6.72, standard deviation of 3.60; women over 
21 had a mean of 4.45, standard deviation of 
3.28. The differences between the mean food 
aversions reported by each age group were 
significant at the .001 level. 

The mean number of occupation aversions 
for men 21 and under was 5.70, with a stand- 
ard deviation of 4.78; for men over 21, the 
mean was 4.08, with a standard deviation of 
4.07. The difference between means was sig- 
nificant at the .05 level. Means and standard 
deviations for women were 6.61 and 4.31 
for women 21 and under, and 4.88 and 4.39 
for women over 21, the difference between 
means being significant at the .01 level. Al- 
though women tended to reject more occupa- 
tions than did men, this difference must be 
considered in light of the fact that women had 
a different list of occupations to check. 

Presumably, if both food and occupation 
aversions were a function of a more general- 
ized aversive attitude, scores made on the two 
questionnaires would be positively correlated. 
Product-moment correlation between the two 
questionnaires for the younger men was .48, 
significant at the .001 level: for the older 
men it was .22, significant at the .05 level; 
for the younger women it was .23, significant 


Bre 
= 
| 
| 
ive 
y 
= 
= 
+ 
4 
= 
\ > 


102 


at the .05 level; and for the older women it 
was .28, significant at the .01 level. 


SUMMARY 


College students 21 and under tended to 
indicate dislike for more foods and more oc- 
cupations than did students over 21. The 
tendency to reject foods was positively cor- 


HENRY CLAY LINDGREN 


related with the tendency to reject occupa- 
tions. 
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BRIEF REPORTS 


PSYCHOLOGICAL REACTIONS TO INFANTRY BASIC TRAINING’ 


PAUL EKMAN 
University of California School of Medicine 
WALLACE \V. FRIESEN 


University of Kansas 


DANIEL R. LUTZKER 
Fort Dix, New Jersey 


One of the working assumptions of military 
psychiatry is that basic training has a beneficial 
psychological impact on the typical recruit. The 
plausibility of this hypothesis was explored by 
administering the MMPI to recruits selected 
from the first, fourth, and eighth weeks of basic 
training. 

Ninety-three active army personnel having 12 
years of education and between 18 and 22 years 
of age served as subjects. All of the subjects 
were drawn from three basic training companies 
at one military post. Three separate groups of 
recruits were tested, with men from each of the 
companies selected at each of the testing periods: 
28 men were tested from companies in the first 
week of training, 34 different men from the 
fourth week, and 31 additional men from the 
eighth week. 

The MMPI profiles without K correction for 
the mean scores obtained by the first, fourth, 
and eighth week groups are shown in Figure 1. 
Apart from D, Hy, and Mf, there was a consist- 
ent increase in the clinical scales from first, to 
fourth, to eighth week. A ¢ test was used to 
evaluate the significance * of the differences be- 
tween the means for the three groups on the 
MMPI main scales, the Pd, Ma, and Pa sub- 
scales, the ego-strength scale, the recidivism 
scale, and the pathology scale (Dahlstrom & 
Welsh, 1960). There were no significant differ- 


1An extended report of this study may be ob- 
tained without charge from Paul Ekman (University 
of California School of Medicine; San Francisco 22, 
California) or for a fee from the American Docu- 
mentation Institute. Order Document No. 6875 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress; Washington 25, 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to: 
Chief, Photoduplication Service, Library of Congress. 

2 Two-tailed probability tests were either at .05 or 
01 levels of confidence. 


ences between the first and fourth week groups. 
The eighth week group obtained higher scores 
than the fourth week group on Pa 1 (ideas of 
external influence, p< .01), Ma 1 (amorality, 
p < .05), and recidivism (p < .01). The eighth 
week group had a lower score than the fourth 
week group on K (p < .05). The eighth week 
group’s scores were higher than the first week 
group on F (p<01), K (p< .01), Hs (p 
<.05), Pd (p< 05), Ma (p< 01), Pd 4 
(alienation, p < .05), Pa 1 (p< .01), Ma 1 (p 
< .01), Ma 4 (ego inflation, p < .01), recidivism 
(p < .01), and pathology (p < .01). The eighth 
week group’s score on K (p< .01) was lower 
than the first week group. 

There was no increase in scores on ego- 
strength, or any other evidence of beneficial psy- 
chological effects accruing from basic training. 
The change in the shape of the profiles suggests 
that aggressive, impulsive, and energetic features 
became slightly more prominent. The most cau- 
tious explanation of this change would be that 
the subjects were more willing to admit mild 
kinds of antisocial behavior in the eighth week, 
since they had finished their training, and per- 
haps also experienced some relief at completing 


Eighth Week 


Fic. 1. Mean MMPI profiles without K correction 
for the first, fourth, and eighth week groups. 
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an arduous task. The changes on the subscales 
imply that more callous attitudes, a tendency to 
ignore the needs of others, and feelings of self- 
importance increase slightly during basic train- 
ing. The recruits appear less prone to examine 
their own responsibility for conflicts, and more 
ready to react aggressively. 

The fact that the eighth week group obtained 
a mean score on the recidivism scale close to that 
reported for men with multiple AWOLs, and a 


Fisher and Cleveland (1958) have evolved a 
Rorschach content scoring system they relate to 
body image. They categorize responses having 
vulnerable boundaries (e.g.. mashed bug) as 
Penetration (P) and those with firm boundaries 
(e.g., cocoon) as Barrier (B). Across-groups, 
empirical evidence supports these as meaningful 
categorizations. Regarding their possible use in 
longitudinal studies, the present study assesses 
stability of B and P over time with normal sub- 
jects in a relatively constant environment. 

The subjects were 20 males, hospitalized with 
long term physical disorders but screened for 
absence of psychiatric disorder. All were tested 
twice 2 months apart by the same experimenter, 
using Beck’s instructions (Schneider, 1955). The 
40 protocols, arranged randomly, were scored for 
B and P by two raters working independently.* 
To correct for correlation with response total, 
scores from the longer record of each subject 
were reduced in accordance with the ratio of his 
two response totals. 

Both raters learned the scoring system via 
Fisher and Cleveland’s text and had not con- 
ferred about scoring problems; so a check on 
interrater agreement seemed appropriate. Pearson 


1 An extended report of this study may be ob- 
tained without charge from Paul G. Daston (De- 
partment of Psychology, University of Maryland; 
College Park, Maryland) or for a fee from the 
American Documentation Institute. Order Document 
No. 6937 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 
for microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 

2We are grateful to William Groman, Duke 
University, for his assistance. 
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STABILITY OF RORSCHACH PENETRATION AND BARRIER 
SCORES OVER TIME* 


PAUL G. DASTON ann OWEN L. McCONNELL 


Duke University 


mean score on the pathology scale higher than 
that reported for neurotic veterans, raises some 
question as to the validity of these two scales. 
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r’s between raters were .84 for B and .61 for P. 
The P discrepancy required further investigation. 
When the two raters scored a new record to- 
gether, they diverged on implied vs. explicit 
Penetration. After differences in interpretation 
were clarified, the raters scored another set of 
48 records, with P agreement rising to an r of 
.79, certainly more acceptable. 

Examining stability, test-retest r’s were for B 
and P respectively: Rater A, .90, .82; Rater B, 
.88, .79.5 Differences between means for related 
samples were not significant for either rater on 
B or P scores. However, both raters found P 
scores to decrease on retest (almost at .05 level 
for Rater A). Since mild psychological changes 
probably occurred in the subjects between tests 
(e.g., improvement in health), P may be a more 
sensitive indicator of minimal change than B. 

Overall, the Fisher-Cleveland scoring system 
seems sufficiently objective and stable under the 
conditions employed to be of value in assessing 
effects of various treatment conditions over time. 
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Fisoer, S., & Creveranp, S. E. Body image and 
personality. Princeton: Van Nostrand, 1958. 
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vals on the reproduction and recall of Rorschach 
responses on retest. Unpublished doctoral disserta- 
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3 Sidney Cleveland and Seymour Fisher were also 
kind enough to score these records for B and P. They 
minimized effects of differential response productivity 
on a card by card basis, after which they calculated 
r’s. Test-retest r for B was .93, for P, .90, which 
can be taken as independent evidence for the stability 
of the scoring system over time. 
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A COMPARISON OF 


THE SHIPLEY-HARTFORD AS A POWER 


TEST WITH THE WAIS VERBAL SCALE ' 


H. J. WAHLER ano LUKE S. WATSON 


University Healih Center, Ohio State University 


In many applied settings there is practical in- 
terest in efficient screening tests of intellectual 
functioning. One test frequently used has been 
the Shipley-Hartford. Correlations between scores 
on this test and the Wechsler-Bellevue have 
ranged from .69 to .79 as reported by Garfield 
and Fly (1948), Sines (1958), and Wright 
(1945). These authors used standard administra- 
tion with 10-minute time limits per subtest and 
did not control time span between administra- 
tions of the two tests. 

The purpose of this study was to determine 
the correlation between the WAIS Verbal IO 
and the two S-H subtests, singly and in combina- 
tion, when the S-H is given without time limits. 
Two samples were employed and the multiple 
regression weights for the S-H Vocabulary and 
Abstract Reasoning scores double cross-validated. 
Other relations investigated included WAIS 
Verbal IQ with S-H subtest scores combined 
with unit weights, the WAIS personally adminis- 
tered Vocabulary with the S-H self-administered 
multiple-choice Vocabulary, and the number of 
S-H multiple-choice vocabulary items attempted 
relative to WAIS Verbal IQ. 

Two samples of white adults seeking help at a 
mental hygiene clinic and rehabilitation center 
were tested by two experimenters. The first 
sample consisted of 50 subjects—30 females and 
20 males. The second sample was comprised of 
23 females and 32 males. The WAIS Verbal 
Scale and S-H were administered during the 
same testing session in all cases. 

The S-H total score (V+ A) correlated .88 
with WAIS Verbal IQ for Sample 1 and .87 for 
Sample 2. Multiple R for the two S-H subtests 


1 An extended report of this study may be obtained 
without charge from H. J. Wahler (University 
Health Center; 320 West Tenth Avenue; Columbus 
10, Ohio) or for a fee from the American Documen- 
tation Institute. Order Document No. 6938 from ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D. C. 
remitting in advance $1.25 for microfilm or $1.25 for 
photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress 


with WAIS Verbal IQ was .88 for both samples. 
Whether S-H Vocabulary and Abstract scores 
were combined with unit or optimum linear 
weights, over 77% of the variance in WAIS 
Verbal scores was accounted for by variance in 
the combined Shipley scores; this was true for 
both samples. When } weights derived from the 
multiple regression equation for the first set of 
data were applied to scores of the second sample, 
R was .878; with b weights reversed, R was 
.877. The two methodologically different vocabu- 
lary tests correlate consistently high with r’s of 
85 and .86 for Samples 1 and 2, respectively. It 
is also noteworthy that brighter people tended to 
attempt more S-H Vocabulary items than did 
duller subjects as reflected in the significant cor- 
relations between WAIS Verbal IQ and number 
of items attempted: r= .54 for Sample 1 and 
r = .53 for Sample 2. 

When a double entry table based on six cate- 
gories from the WAIS manual was used, 59% of 
the subjects were classified the same with S-H 
IQ estimates as with their WAIS Verbal IQs. 
Seventeen percent of the cases were overestimated 
and 20% underestimated by one category. Only 
6% were missed by two categories. With a 
dichotomous classification of above or below 
average, S-H estimates correctly classified 87% 
of the subjects. 

When IQ is a minor variable in a decision com- 
plex and only a broad categorization is required 
for the circumstances, the findings support the 
position that a Shipley estimate can be sufficient. 
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Therapy process research has been hampered 
by poor rating reliability for the variables under 
study, particularly when ratings are based on 
short units of interaction. One difficulty may be 
that the variables have complex and subtle re- 
lations to concrete therapist behaviors. Thus 
one might logically analyze a concept into sim- 
pler, easier to measure components. Another 
approach is to compare rating conditions em- 
pirically to see which leads to the most reliable 
results; but reliability may be bought at the 
price of validity—in some cases reliability goes 
up as less information is given to the rater. Simi- 
larly, when ratings are averaged over raters the 
result may be a reliable score with what could 
be called a common denominator character. We 
may lose the fineness and texture of individual 
sensitivities. This paper describes an attempt to 
avoid these difficulties, using “therapist com- 
mitment” as the variable. 

A total of 385 therapist statements were 
abstracted from interview transcripts and 
mounted on 3 X 5 cards. One experienced and 
sophisticated therapist rated each statement on 
“degree of commitment,” using a four alternative 
scale. He rated 285 cards in the sequence in 
which they were originally spoken (higher in- 
formation condition) and 100 cards in random 
sequence (lower information). Four graduate 
students in clinical psychology were then taught 
to postdict these (criterion) ratings by a flash- 
card procedure: The student read a statement, 
guessed what the criterion rating must be, then 
turned the card over—the correct criterion 


1An extended report of this study may be ob- 
tained without charge from Robert Ryder (Depart- 
ment of Psychology, University of Michigan; Ann 
Arbor, Michigan) or for a fee from the American 
Documentation Institute. Order Document No. 6935 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washing- 
ton 25, D. C., remitting in advance $2.00 for 
microfilm or $3.75 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 


TEACHING RELIABLE RATING OF A PROCESS VARIABLE * 


ROBERT RYDER 


University of Michigan 


rating was on the back of the card. Training 
was continued for eight sessions, one per week, 
each session being followed by a cathartic dis- 
cussion session, which was one technique used 
to maintain high motivation. Other techniques 
were in the social atmosphere that was main- 
tained (the study was referred to as “the devel- 
opment of a process variable,” not: as a rating 
study, the students were called “participants,” 
not raters, and a girl was hired to serve them 
coffee and otherwise emphasize their impor- 
tance), and in wages paid—four dollars per 
hour. Participants were instructed to not men- 
tion particular statements in the discussion 
sessions. 

Taus relating postdictions to criterion ratings 
were .94, .93, .92, and .71 for the final session, 
as compared to first session taus of .44, .35, 
and .27. These taus refer to the relation between 
criterion ratings and participants’ first guesses 
as to those ratings; i.e., each of these guesses 
was made when a participant was encountering 
a card he had never seen before. Our median tau 
of .925 seems favorable in relation to Williams’ 
(1959) median interrater product-moment reli- 
ability, also for commitment, of .68 (she trained 
her raters by having discussions with them), 
and with the retest reliability (tau, after a 3 
month period) of our criterion rater, which was 
also .68. We conclude that one may find par- 
ticipants who by using this technique can achieve 
a great increment in consistency in their use of 
the words “therapist commitment.” Note that 
the concept is as stable over experiments as we 
choose to make it, as it is entirely defined by a 
concrete set of criterion ratings. 

There was no relation of postdiction ac- 
curacy to either level of information or to 
(rated) level of participants’ motivation. 
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DEGREE OF AGREEMENT AMONG THREE “INTERPERSONAL 
SYSTEM” MEASURES * 


MALCOLM D. GYNTHER 
Washington University Medical School 


Leary (1957) states that public interpersonal 
behavior (Level I) is measured by the MMPI 
and sociometric ratings and that private self- 
description (Level II) is measured by the 
Interpersonal Check List (ICL). He has found 
significant positive relations between MMPI data 
and sociometric ratings of behavior of patients 
in group psychotherapy, but has not compared 
self-ratings with the other measures. The aim 
of this study is to determine the agreement 
among interpersonal circle classifications of non- 
psychiatric subjects obtained by these three as- 
sessment techniques. 

Sixty-seven male subjects and 33 female sub- 
jects, graduate psychology students (N =8), 
five groups of clinical pastoral trainees (N = 30), 
and four classes of college extension students 
(N = 62), participated. Each group was small 
enough to allow considerable interaction. Both 
sexes were used as previous work had shown no 
significant differences referable to sex between 
octant distributions of MMPIs and ICLs for 
nonpsychiatric subjects. Subjects were given 
MMPIs and ICLs about 1 month after their 
classes began, with instructions to rate self and 
the other members of their group on the ICL. 
Groups larger than eight were divided so that no 
more than eight subjects would be in any one 
rating group. These groups ranged in size from 
four to eight, with a median of six members. 

MMPI standard scores were converted into 
Dominance (Dom) and Love (Lov) scores by 
means of the following formulae: Dom =(Ma — 
D)+(Hs — Pt) and Lov =(K — F)+(Hy— Sc). 
These indices determine the octant rating. Similar 


1An extended report of this study may be ob- 
tained without charge from Malcolm D. Gynther 
(1420 Grattan Street; St. Louis 4, Missouri) or for 
a fee from the American Documentation Institute. 
Order Document No. 6936 from ADI Auxiliary Pub- 
lications Project, Photoduplication Service, Library 
of Congress; Washington 25, D. C., remitting in 
advance $1.25 for microfilm or $1.25 for photocopies. 
Make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


procedures yielded octant ratings of ICL data. 
Sociometric ratings were obtained by adding the 
Dom and Lov scores given to a subject by the 
other subjects in his group, dividing the totals 
by the number of “others,” and determining an 
octant rating for the resulting values. Agreement 
between different measures was considered to 
exist if the different ratings fell in the same 
octant, a more rigorous criterion than Leary 
uses. 

Most of the ratings fell in what Leary has 
described as the “healthy” octants (i.e., 2, 1, 8, 
7). Agreement between ratings may be spuriously 
high due to these constricted distributions. The 
percentages of “hits” between classifications de- 
rived from MMPI and sociometric, self-descrip- 
tion and sociometric, and MMPI and self-descrip- 
tion were 26, 42, and 21, respectively. Chi square 
analysis (based on Octants 1, 8, and “other’’) of 
MMPI and sociometric data showed that agree- 
ment was not significantly different from chance. 
However, agreement between self-descriptions 
and sociometric ratings, using Octants 1, 2, and 
other categories, was highly significant (x? = 
13.89, df=4, p< .01). Analysis (based on 
Octants 1, 8, and other) of MMPI and self-rating 
data also showed a significant degree of agree- 
ment (x?= 9.52, df =4, p< .05). 

Operating within the framework of the Inter- 
personal System, one would predict more agree- 
ment between ratings presumably tapping the 
same level (MMPI and sociometric) than ratings 
supposedly tapping different levels (MMPI and 
self-description and sociometric and self-descrip- 
tion). Our contradictory findings cast doubt on 
Leary’s assertion that these tests measure speci- 
fied, distinct levels of behavior. These results also 
indicate that the positive MMP!I-sociometric 
relationship found with psychiatric subjects does 
not hold with nonpsychiatric subjects. 
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Hoyt and Baron (1959) tested the validity of 
12 presumed anxiety indicators on the Draw- 
A-Person Test (DAP) using the Manifest 
Anxiety (MA) scale as an external criterion. It 
was found that Placement (upper left hand 
corner) and Size (less than 4% inches) were 
displayed by a significantly greater proportion 
of anxious than nonanxious female psychiatric 
patients. 

The present study replicated the Hoyt-Baron 
study with a male psychiatric sample. In ad- 
dition, the validity of the potential anxiety 
indicators on the DAP was tested against a 
second anxiety criterion; the Rorschach Content 
Test (RCT) developed by Elizur (1949). The 
relationship of each DAP index to intelligence 
was also estimated. 

The drawings of 123 subjects were scored for 
presence or absence of each drawing character- 
istic following the Hoyt-Baron scoring system. 
Scoring reliabilities ranged from .84 to 1.00. 
Point biserial correlations were computed be- 
tween each DAP anxiety index and MA scale 
scores, RCT scores, and intelligence (as measured 
by the Verbal WAIS). In addition, chi-square 
analyses of differences in proportions of anxious 
and nonanxious subjects displaying each index 
were computed using both the MA scale and 
RCT as bases for forming groups. 

There was substantial agreement between the 
results of the Hoyt-Baron study and the present 


1An extended report of this study may be ob- 
tained without charge from Robert E. Mogar (Mt. 
Zion Psychiatric Clinic; 2255 Post Street; San 
Francisco 15, California) or for a fee from the 
American Documentation Institute. Order Docu- 
ment No. 6934 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


ANXIETY INDICES IN HUMAN FIGURE DRAWINGS: 


A REPLICATION AND EXTENSION ! 
ROBERT E. MOGAR 


Pennsylvania State University 


replication in that little relationship was found 
between the DAP anxiety indices and the MA 
scale. Only Relative Size of Head to Figure 
(large) was significantly related to MA scale 
scores (r= .17). Contrary to Hoyt and Baron, 
neither Placement nor Size significantly differenti- 
ated extreme MA groups. These disparate results 
seemed primarily a function of the sex of the 
samples employed. 

The expectation that the RCT would provide 
a more appropriate criterion of clinical anxiety 
than the MA scale was supported by the finding 
that size (r = .26), Relative Size of Head to Fig- 
ure (r=.24), Omissions (r=.18), and Hair 
Shading (r=.14) were significantly related to 
RCT scores. A_ negative relationship was 
found between Reinforcement and RCT scores 


(r= -—.31). Intelligence was significantly re- 
lated to Erasing (r= .21) and Relative Size of 
Head to Figure (r = —.17). 


These results indicated that the MA scale 
does not measure anxiety along the same con- 
tinuum as the RCT and DAP. The lack of 
relationship between MA and RCT scores 
(r= .07) supports this conclusion. The present 
study also suggests that indirect measures of 
anxiety such as the RCT which can be reliably 
quantified may be less convenient than paper- 
and-pencil tests but perhaps more conceptually 
and clinically useful. 
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From four years of intensive 
research ... new and valuable 
information on 


THE 
PSYCHOTHERAPY 
RELATIONSHIP 


By William U. Snyder, 
The Pennsylvania State University 
With the collaboration of B. June Snyder 


A four-year research project has pro- 
duced this valuable book, which offers 
an intensive study of the psychotherapy 
relationship. The author presents exten- 
sive case studies, demonstrating the 
methods employed and focusing upon 
the dynamic relationship between thera- 
pist and client. Dr. Snyder’s eclectic 
approach places strong emphasis on the 
interpersonal relationship formed during 
counseling and therapy, and demonstrates 
techniques and skills that are unusually 
flexible and effective. The Psychotherapy 
Relationship is a basic text for courses in 
psychotherapy and a supplementary text 


and reference for courses in Personality 
and Abnormal Psychology. Some of the 
features that will make you want to have 
a copy for your personal use are: 


© Correlated research and question- 
naire data for 550 recorded 
interviews. 

© Discussion of the use of relationship 
dreams in psychotherapy. 

© Illustration and analysis of thera- 
peutic process. 

® Many examples of positive and 
negative transference and of the 
therapist’s countertransference. 

© Thorough examination of one case 
study and selections from 19 others. 


EXAMINE THIS INFORMATIVE NEW STUDY FOR 
TEN DAYS WITHOUT OBLIGATION. 


The Macmillan Company, 60 Fifth Avenue, 
New York 11, N. Y. Box JC-1 


Please send me a copy of Snyder: THE PSYCHO- 
THERAPY RELATIONSHIP @ $7.50 
Il may return it in ten days without cost or 
obligation. Otherwise bill me. (Save! Send payment 
with your order and we pay for delivery.) 

Payment enclosed Bill 
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COUNSELING FOR PERSONAL ADJUSTMENT 
In Schools and Colleges 
Fred McKinney 


This text for the first course in counseling concentrates on certain 
basic principles and techniques, with special emphasis on the nature 
of personality structure and dynamics. Sixty case studies illustrate 
counseling in an educational context. 


584 pages 1958 $6.75 


CLIENT-CENTERED THERAPY 
Carl R. Rogers 


A mature presentation of the non-directive and related points of view 


in counseling and therapy, this volume orients the professional public 
to client-centered therapy and details its applications and implications 
to several fields, including work with children and groups. 


560 pages 1951 75 


Houghton Mifflin 


BOSTON: NEW YORK: ATLANTA 
GENEVA. ILL- DALLAS - PALO ALTO. 
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Tall Oaks From Little Acorns Grew 


Through the ages, man has held a 
tiny acorn in his hand and envisioned 
the future giant of the forest. 


What laws of nature must operate 
to assure the transition from insig- 
nificance to full growth? 


Placement in fertile soil; protec- 
tion of the life essence ; nourishment 
from natural elements; time to de- 
velop into the type of tree nature 
intended — these combine to form 
the right environment. 


So life’s power in individuals 
comes to its full realization through 
proper conditions working to nur- 
ture innate potentials. 


Aware of latent possibilities in 
each person entrusted to its care, 
Devereux Schools—now celebrating 
its 50th Anniversary — has grown 
from a very small beginning to serve 
today more than a thousand children 


Dawid Everett 


and young adults in schools, commu- 
nities, and camps in four states. 
Through the past half century, it has 
provided the right environment for 
thousands of mentally retarded and 
emotionally disturbed children who 
have benefitted from it and returned 
to their families and communities. 


Thus, from the acorn of an idea 
has grown a stalwart tree .... 
THE DEVEREUX FOUNDATION. 


Your inquiries are invited. For 
further information and literature, 
East Coast residents may contact Mr. 
Charles J. Fowler, Director of Ad- 
missions, Devereux Schools, Devon, 
Pa. Those in the West Coast area, 
Mr. Keith A. Seaton, Registrar, 
Devereux Schools in California, 
Santa Barbara, Calif. Southwestern 
residents, Mr. Richard Danko, 
Executive Administrator, Devereux. 
Schools in Texas, Victoria, Texas. 


* Since the founding of Devereux Schools in 1912, a philosophy of Special 
Education and therapeutic techniques of rehabilitation and care have been 
developed for the sole reason of insuring to the greatest possible extent the 
future happiness and well-being cf the retarded or emotionally disturbed 
child. Many of these techniques originated at Devereux Schools. This series 
is launched during the Schools’ 50th Anniversary in order that you may 
better understand the methods and goals of Devereux Schools. 


The Devereux Foundation 


Helena T. Devereux, Founder and Consultont * Edward L. French, Ph.D., President and Director 
A nonprofit organization ... Devon, Penna. « Senta Barbara, Celif. « Victoria, Texas 
SCHOOLS + COMMUNITIES *« CAMPS * TRAINING « RESEARCH 
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“The best in tests. .. .” 

Could you use a projective trst elicit- 
ing a rich variety of unguarded re- 
sponses from deep layers of the 
personality? Easily and quickly ad- 
ministered? Giving results of demon- 
strated reliability and validity? For 
children as well as adults? 


The 
KAHN TEST 
SYMBOL 

ARRANGE- 
MENT 


uses simple plastic objects arranged by 
the examinee in various sequences ac- 
cording to several different sets of 
easily-followed instructions. An ex- 
cellent supplement to or substitute for 
the TAT and Rorschach, combining 
the principal advantages of both. 


Obtain information and/or order (com- 
plete set $25.00) from 


PSYCHOLOGICAL TEST 
SPECIALISTS 


Box 1441 
Missoula, Montana 


“The best in tests. . . .” 
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